SlideShare ist ein Scribd-Unternehmen logo
1 von 44
1
Live Note/QA: http://tinyurl.com/Evorus
Evorus:
A Crowd-Powered
Conversational Assistant
Built to Automate Itself Over Time
Ting-Hao (Kenneth) Huang*
Joseph Chee Chang
Jeffrey P. Bigham
[ Question / Feedback: http://tinyurl.com/Evorus ]
*Kenneth will join in Fall 2018
2
Live Note/QA: http://tinyurl.com/Evorus
3
Live Note/QA: http://tinyurl.com/Evorus
4
Live Note/QA: http://tinyurl.com/Evorus
5
Live Note/QA: http://tinyurl.com/Evorus
6
Live Note/QA: http://tinyurl.com/Evorus
7
Live Note/QA: http://tinyurl.com/Evorus
What just
happened?
• Open Conversation
• Multi-turn interaction
• Multiple domains
• Personalized
• Coherent dialog
• Mix of task-oriented
and social conversation
8
Live Note/QA: http://tinyurl.com/Evorus
Today’s Conversational Assistants…
“What’s new with Alexa?”“Talking to Siri”
9
Live Note/QA: http://tinyurl.com/Evorus
Existing Approaches to
Open Conversation
• Combining multiple task-oriented dialog systems
• DialPort (Zhao, et al., 2016)
• End-to-end framework for dialogue systems
• Serban, et al. 2016; Li, et al. 2017
• Adapting a model to many other domains
• Walker, et al., 2007; Sun, et al., 2016
• Chit-chat systems (social bot)
• Hold social conversations (Banchs, et al., 2012)
• Still a very hard problem…
10
Live Note/QA: http://tinyurl.com/Evorus
Existing Approaches to
Open Conversation
• Combining multiple task-oriented dialog systems
• DialPort (Zhao, et al., 2016)
• End-to-end framework for dialogue systems
• Serban, et al. 2016; Li, et al. 2017
• Adapting a model to many other domains
• Walker, et al., 2007; Sun, et al., 2016
• Chit-chat systems (social bot)
• Hold social conversations (Banchs, et al., 2012)
• Still a very hard problem…
MIT Technology Review
Feb 27, 2018
11
Live Note/QA: http://tinyurl.com/Evorus
A Top-Down Approach
Fully-Automated System
Human-
Powered
System
12
Live Note/QA: http://tinyurl.com/Evorus
A Top-Down Approach
Fully-Automated System
Crowd-AI System
Human-
Powered
System
13
Live Note/QA: http://tinyurl.com/Evorus
A Top-Down Approach
Fully-Automated System
Crowd-AI System
Human-
Powered
System
14
Live Note/QA: http://tinyurl.com/Evorus
In Today’s Talk…
Fully-Automated System
Evorus
Chorus
15
Live Note/QA: http://tinyurl.com/Evorus
Chorus: A Crowd-Powered
Conversation Assistant
Evorus
16
Live Note/QA: http://tinyurl.com/Evorus
•Crowd workers collectively hold a
conversation by:
1. Propose Responses
2. Vote Responses
3. Take Notes
Reward points for
each action
"Is there anything else I can help you with?": Challenges in Deploying an On-Demand
Crowd-Powered Conversational Agent. Huang, et al. HCOMP 2016.
Chorus: A Crowd-Powered
Conversation Assistant
17
Live Note/QA: http://tinyurl.com/Evorus
User Interface
17 / 31
18
Live Note/QA: http://tinyurl.com/Evorus
User & Worker Interface
18 / 31
19
Live Note/QA: http://tinyurl.com/Evorus
20
Live Note/QA: http://tinyurl.com/Evorus
We Deployed Chorus
• Launched on May 20th, 2016
• On Google Hangouts
• 2200+ conversations, 420+ users
• TalkingToTheCrowd.org
21
Live Note/QA: http://tinyurl.com/Evorus
Fully-Automated System
Evorus
Chorus
Evorus: A Crowd-Powered Conversational Assistant
Built to Automate Itself Over Time
22
Live Note/QA: http://tinyurl.com/Evorus
Automating Chorus
23
Live Note/QA: http://tinyurl.com/Evorus
Automating Chorus
24
Live Note/QA: http://tinyurl.com/Evorus
Automating Chorus
25
Live Note/QA: http://tinyurl.com/Evorus
Automating Chorus
26
Live Note/QA: http://tinyurl.com/Evorus
Chatbots
How to select
chatbots
automatically?
27
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Chatbot’s
Performance
Topic Similarity
Likelihood
of a
Chatbot
28
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Chatbot’s
Performance
Topic Similarity
Likelihood
of a
Chatbot
~= Overall Message
Acceptance Rate
29
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Topic Similarity
User Message
Domain of
the Chatbot
Hey what should
I eat in Montreal?
30
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Topic Similarity
User Message
Domain of
the Chatbot
Hey what should
I eat in Montreal?
Find me some
good restaurants !
Where can I get
Chinese food?
Example
Triggering
Message
31
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Topic Similarity
User Message
Domain of
the Chatbot
Hey what should
I eat in Montreal?
Example
Triggering
Message
Find me some
good restaurants !
Where can I get
Chinese food?
Topic
Similarity
32
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Chatbot’s
Performance
Topic Similarity
Likelihood
of a
Chatbot
33
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Chatbot’s
Performance
Topic Similarity
Likelihood
of a
Chatbot
 Add more chatbots over time !
34
Live Note/QA: http://tinyurl.com/Evorus
35
Live Note/QA: http://tinyurl.com/Evorus
Automatic Upvote
How to estimate
the impact of an
automation?
36
Live Note/QA: http://tinyurl.com/Evorus
Find the Best Confidence Threshold
• High Threshold
• Only vote when pretty sure
• High precision, but little benefit
• Low Threshold
• Nearly always vote
• Grant agreement bonus by mistake
• Damage conversation quality
37
Live Note/QA: http://tinyurl.com/Evorus
Find the Best Confidence Threshold
Expected Reward Points Saved
38
Live Note/QA: http://tinyurl.com/Evorus
Automating Chorus Over Time
39
Live Note/QA: http://tinyurl.com/Evorus
Automating Open Conversations
• Setup
• A 5-month-long deployment, 80 Users
• 4 chatbots + 1 voting bot
• Result
• Automated responses were chosen 12.44% of the time.
• Human upvotes were reduced by 13.81%.
• The cost of each message is reduced by 32.76%.
• Conversation quality and user
satisfaction level remains.
• Conversation Quality: Satisfaction,
Clarity, Responsiveness, Comfort
(Liu, et al., 2010)
40
Live Note/QA: http://tinyurl.com/Evorus
Conclusion & Future Work
• Automating some part of an open conversation is
possible.
• Future Work
• 1000+ Chatbots!
• Deploy Chorus as an Open Research Platform
• Chorus on Smart Devices
41
Live Note/QA: http://tinyurl.com/Evorus
Conclusion & Future Work
• Automating some part of an open conversation is
possible.
• Future Work
• 1000+ Chatbots!
• Deploy Chorus as an Open Research Platform
• Chorus on Smart Devices
@windx0303
KennethHuang.cc
Ting-Hao (Kenneth) Huang
QA: http://tinyurl.com/Evorus
42
Live Note/QA: http://tinyurl.com/Evorus
Backup Slides
43
Live Note/QA: http://tinyurl.com/Evorus
44
Live Note/QA: http://tinyurl.com/Evorus
Features of
Automatic Voting

Weitere ähnliche Inhalte

Ähnlich wie Evorus: A Crowd-Powered Conversational Assistant Built to Automate Itself Over Time

Abraham aranguren. legal and efficient web app testing without permission
Abraham aranguren. legal and efficient web app testing without permissionAbraham aranguren. legal and efficient web app testing without permission
Abraham aranguren. legal and efficient web app testing without permission
Yury Chemerkin
 

Ähnlich wie Evorus: A Crowd-Powered Conversational Assistant Built to Automate Itself Over Time (20)

Using Chatbots in Extension Programming
Using Chatbots in Extension ProgrammingUsing Chatbots in Extension Programming
Using Chatbots in Extension Programming
 
Measure camp tools of the cro rabble
Measure camp   tools of the cro rabbleMeasure camp   tools of the cro rabble
Measure camp tools of the cro rabble
 
Automating web application security testing using Hubot AI / NLP chat bot - B...
Automating web application security testing using Hubot AI / NLP chat bot - B...Automating web application security testing using Hubot AI / NLP chat bot - B...
Automating web application security testing using Hubot AI / NLP chat bot - B...
 
A Case Study of Using Selenium IDE and WebDriver_Presenation
A Case Study of Using Selenium IDE and WebDriver_PresenationA Case Study of Using Selenium IDE and WebDriver_Presenation
A Case Study of Using Selenium IDE and WebDriver_Presenation
 
Open source-secret-sauce-rit-2010
Open source-secret-sauce-rit-2010Open source-secret-sauce-rit-2010
Open source-secret-sauce-rit-2010
 
Abraham aranguren. legal and efficient web app testing without permission
Abraham aranguren. legal and efficient web app testing without permissionAbraham aranguren. legal and efficient web app testing without permission
Abraham aranguren. legal and efficient web app testing without permission
 
Taking your Site from One to One Million Users by Kevin Rose
Taking your Site from One to One Million Users by Kevin RoseTaking your Site from One to One Million Users by Kevin Rose
Taking your Site from One to One Million Users by Kevin Rose
 
The Apache Way
The Apache WayThe Apache Way
The Apache Way
 
Let's Contribute
Let's ContributeLet's Contribute
Let's Contribute
 
How to Use HipChat to Collaborate and Build Culture - Matthew Weinberg
How to Use HipChat to Collaborate and Build Culture - Matthew WeinbergHow to Use HipChat to Collaborate and Build Culture - Matthew Weinberg
How to Use HipChat to Collaborate and Build Culture - Matthew Weinberg
 
Legal and efficient web app testing without permission
Legal and efficient web app testing without permissionLegal and efficient web app testing without permission
Legal and efficient web app testing without permission
 
Introduction to Facebook Messenger, Conversational UI & NLP
Introduction to Facebook Messenger, Conversational UI & NLPIntroduction to Facebook Messenger, Conversational UI & NLP
Introduction to Facebook Messenger, Conversational UI & NLP
 
Open Source Library System Software: Libraries Are Doing it For Themselves
Open Source Library System Software: Libraries Are Doing it For ThemselvesOpen Source Library System Software: Libraries Are Doing it For Themselves
Open Source Library System Software: Libraries Are Doing it For Themselves
 
#ImpactSalesforceSaturday: Einstein bot basic to advanced
#ImpactSalesforceSaturday: Einstein bot basic to advanced#ImpactSalesforceSaturday: Einstein bot basic to advanced
#ImpactSalesforceSaturday: Einstein bot basic to advanced
 
Create Great CNCF User-Base from Lessons Learned from Other Open Source Commu...
Create Great CNCF User-Base from Lessons Learned from Other Open Source Commu...Create Great CNCF User-Base from Lessons Learned from Other Open Source Commu...
Create Great CNCF User-Base from Lessons Learned from Other Open Source Commu...
 
AOD Marketing : Google Analytics Seo Camp 2012
AOD Marketing : Google Analytics Seo Camp 2012AOD Marketing : Google Analytics Seo Camp 2012
AOD Marketing : Google Analytics Seo Camp 2012
 
Contributing to YUI
Contributing to YUIContributing to YUI
Contributing to YUI
 
SEO Exellence with ChatGPT-Webinar Duda
SEO Exellence with ChatGPT-Webinar DudaSEO Exellence with ChatGPT-Webinar Duda
SEO Exellence with ChatGPT-Webinar Duda
 
Using Blockchain to Increase Supply Chain Transparency
Using Blockchain to Increase Supply Chain TransparencyUsing Blockchain to Increase Supply Chain Transparency
Using Blockchain to Increase Supply Chain Transparency
 
Advanced automated visual testing - DrupalCon Global 2020
Advanced automated visual testing - DrupalCon Global 2020Advanced automated visual testing - DrupalCon Global 2020
Advanced automated visual testing - DrupalCon Global 2020
 

Mehr von Ting-Hao Huang

A Crowd-Powered Conversational Assistant That Automates Itself Over Time
A Crowd-Powered Conversational Assistant That Automates Itself Over TimeA Crowd-Powered Conversational Assistant That Automates Itself Over Time
A Crowd-Powered Conversational Assistant That Automates Itself Over Time
Ting-Hao Huang
 

Mehr von Ting-Hao Huang (7)

A 10-Month-Long Deployment Study of On-Demand Recruiting for Low-Latency Crow...
A 10-Month-Long Deployment Study of On-Demand Recruiting for Low-Latency Crow...A 10-Month-Long Deployment Study of On-Demand Recruiting for Low-Latency Crow...
A 10-Month-Long Deployment Study of On-Demand Recruiting for Low-Latency Crow...
 
Real-time On-Demand Crowd-powered Entity Extraction
Real-time On-Demand Crowd-powered Entity ExtractionReal-time On-Demand Crowd-powered Entity Extraction
Real-time On-Demand Crowd-powered Entity Extraction
 
A Crowd-Powered Conversational Assistant That Automates Itself Over Time
A Crowd-Powered Conversational Assistant That Automates Itself Over TimeA Crowd-Powered Conversational Assistant That Automates Itself Over Time
A Crowd-Powered Conversational Assistant That Automates Itself Over Time
 
"Is there anything else I can help you with?": Challenges in Deploying an On-...
"Is there anything else I can help you with?": Challenges in Deploying an On-..."Is there anything else I can help you with?": Challenges in Deploying an On-...
"Is there anything else I can help you with?": Challenges in Deploying an On-...
 
Visual Storytelling (NAACL 2016, Poster)
Visual Storytelling (NAACL 2016, Poster)Visual Storytelling (NAACL 2016, Poster)
Visual Storytelling (NAACL 2016, Poster)
 
Social Metaphor Detection via Topical Analysis
Social Metaphor Detection via Topical AnalysisSocial Metaphor Detection via Topical Analysis
Social Metaphor Detection via Topical Analysis
 
Guardian: A Crowd-Powered Spoken Dialog System for Web APIs
Guardian: A Crowd-Powered Spoken Dialog System for Web APIsGuardian: A Crowd-Powered Spoken Dialog System for Web APIs
Guardian: A Crowd-Powered Spoken Dialog System for Web APIs
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Evorus: A Crowd-Powered Conversational Assistant Built to Automate Itself Over Time