Understanding User Satisfaction with Intelligent Assistants

Understanding User Satisfaction
with Intelligent Assistants
Julia Kiseleva, Kyle Williams, Jiepu Jiang, Ahmed Hassan Awadallah,
Aidan C. Crook, Imed Zitouni, Tasos Anastasakos
Eindhoven University of Technology
Pennsylvania State University
University of Massachusetts Amherst
Microsoft
CHIIR’16, Chapel Hill, NC, USA

Q1: how is the weather in Chicago
Q2: how is it this weekend
Q3: find me hotels
Q4: which one of these is the cheapest
Q5: which one of these has at least 4 stars
Q6: find me directions from the Chicago airport to
number one
User’s dialogue
with Cortana:
Task is “Finding
a hotel in
Chicago”

Q1: find me a pharmacy nearby
Q2: which of these is highly rated
Q3: show more information about number 2
Q4: how long will it take me to get there
Q5: Thanks
User’s dialogue
with Cortana:
Task is “Finding
a pharmacy”

Research Questions
• RQ1: What are characteristic types of scenarios of use?

Controlling Device
• Call a person
• Send a text message
• Check on-device calendar
• Open an application
• Turn on/off wi-fi
• Play music

Knowledge Pane
Image Answer Image Answer
Organic Results

Knowledge Pane
Image Answer Image Answer
Location Answer
Organic Results

User:
“Do I need
to have a
jacket
tomorrow?”
Search Dialogue

User:
“Do I need
to have a
jacket
tomorrow?”
Cortana: “You
could probably
go without one.
The forecast
shows …”
Search Dialogue

Cortana:
“Here are ten
restaurants
near you”
User:
“show
restaurant
s near me”
Search Dialogue

Cortana:
“Here are ten
restaurants
near you”
Cortana:
“Here are ten
restaurants near
you that have
good reviews”
User:
“show
restaurant
s near me”
User:
“show the
best
restaurants
near me ”
Search Dialogue

Cortana:
“Here are ten
restaurants
near you”
Cortana:
“Here are ten
restaurants near
you that have
good reviews”
Cortana:
“Getting you
direction to the
Mayuri Indian
Cuisine”
User:
“show
restaurant
s near me”
User:
“show the
best
restaurants
near me ”
User:
“show
directions to
the second
one”
Search Dialogue

Research Questions
• RQ2: How can we measure different aspects of user satisfaction?
• RQ3: What are key factors determining user satisfaction for the
different scenarios?
• RQ4: How to characterize abandonment in the web search
scenario?
• RQ5: How does query-level satisfaction relate to overall user
satisfaction for the search dialogue scenario?

Research Questions
• RQ2: How can we measure different aspects of user satisfaction?
• RQ3: What are key factors determining user satisfaction for the
different scenarios?
scenario?
USERSTUDY

User Study Participants
55%
45%
LANGUAGE
English Other
• 60 Participants
• 25.53 +/- 5.42 years

75%
25%
GENDER
Male Female
55%
45%
LANGUAGE
English Other
• 60 Participants
• 25.53 +/- 5.42 years

75%
25%
GENDER
Male Female
55%
45%
LANGUAGE
English Other
82%
8%
2%
8%
EDUCATION
Computer Science Electrical Engineering
Mathematics Other
• 60 Participants
• 25.53 +/- 5.42 years

User Study Design
• Video Instructions (same for all participants)
• Tasks are realistic – mined from Cortana logs:
o Control type of tasks
o Queries where users don’t click
o Search dialogue tasks – mostly localization type of queries

Find out what
is the hair
color of your
favorite
celebrity.

You are planning a
vacation. Pick a place.
Check if the weather is
good enough for the
period you are planning
the vacation. Find a hotel
that suits you. Find the
driving directions to this
place.

Questionnaire: Controlling Device
• Were you able to complete the task?
o Yes/No
• How satisfied are you with your experience in this task?
o 5-point Likert scale
• How well did Cortana recognize what you said?
• Did you put in a lot of effort to complete the task?

Questionnaire: Controlling Device
o Yes/No
5 Tasks
20 Minutes

Questionnaire: Good Abandonment
o Yes/No
• Where did you find the answer?
o Answer Box, Image, SERP, Visited Website
• Which query led you to finding the answer?
o First, Second, Third, >= Fourth

Questionnaire: Good Abandonment
o Yes/No
• Where did you find the answer?
o Answer Box, Image, SERP, Visited Website
• Which query led you to finding the answer?
o First, Second, Third, >= Fourth
5 Tasks
20 Minutes

Questionnaire: Search Dialogue
o Yes/No
o If the task has sub-tasks participants indicate their graded
satisfaction e.g.
o a. How satisfied are you with your experience in finding a hotel?
o b. How satisfied are you with your experience in finding directions?

Questionnaire: Search Dialogue
o Yes/No
o If the task has sub-tasks participants indicate their graded
satisfaction e.g.
o a. How satisfied are you with your experience in finding a hotel?
o b. How satisfied are you with your experience in finding directions?
8 Tasks: 1 simple,
4 with 2 subtasks,
3 with 3 subtasks
30 Minutes

Search Dialog Dataset
• 540 tasks that incorporated
• 2, 040 queries, of which 1, 969 were unique
• the average query-length is 7.07
• The simple task generated 130 queries in total
• Tasks with 2 context switches generated 685 queries
• Tasks with 3 context switches generated 1, 355 queries

Factors Determining Satisfaction
RQ3: What are key factors determining user satisfaction
for the different scenarios?

0
1
2
3
4
5
6
Across
Scenarious
Device
Control
Web
Search
Structured
Dialog
5
0
1
2
3
4
5
6
Across
Scenarious
Device
Control
Web
Search
Structured
Dialog
5
SatisfactionLevel
Efforts
Results Over Scenarios
Mean of Satisfaction

Results `Good Abandonment’
RQ4: How to characterize abandonment in the web
search scenario?

0
1
2
3
4
5
6
First Query Second
Query
Third
Query
>= Fourth
Quey
0
1
2
3
4
5
6
Answer
Box
Image SERP Visited
WebSite
5
SatisfactionLevel
Results `Good Abandonment’
Mean of Satisfaction

Search Dialogue Satisfaction
RQ5: How does query-level satisfaction relate to overall
user satisfaction for the structured search dialogue
scenario?

Cortana:
“Here are ten
restaurants
near you”
Cortana:
“Here are ten
restaurants near
you that have
good reviews”
Cortana:
“Getting you
direction to the
Mayuri Indian
Cuisine”
User:
“show
restaurant
s near me”
User:
“show the
best
restaurants
near me ”
User:
“show
directions to
the second
one”
SAT? SAT? SAT?
SAT? SAT? SAT?
Overall
SAT?
?

Satisfaction Over Different Tasks
Satisfaction Level
Weather Task
NumberofAnswers
1 2 3 4 5

Satisfaction Level
Weather Task Mission Task (2 sub-tasks)
NumberofAnswers
1 2 3 4 5

Satisfaction Level
Weather Task Mission Task (2 sub-tasks)
Mission Task (3 sub-tasks)
NumberofAnswers
1 2 3 4 5

Q1: what do you have medicine for the stomach ache
Q2: stomach ache medicine over the counter
Q3: show me the nearest pharmacy
Q4: more information on the second one
Q5: do they have a stool softener
Q6: does Fred Meyer have stool softeners
General Search
Search Dialog
Combination
of scenarios
User’s dialogue with Cortana related to the ‘stomach ache’ problem

Conclusions (1)
• We proposed three main types of scenarios
• RQ2: How can we measure different aspects of user
satisfaction?
• We designed a series of user studies tailored to the three
scenarios
• RQ3: What are key factors determining user satisfaction for
the different scenarios?
• Effort is a key component of user satisfaction across the
different intelligent assistants scenarios

Conclusions (2)
scenario?
• We concluded that to measure good abandonment we need
to investigate the other forms of interaction signals that are
not based on clicks or reformulation
• We looked at user satisfaction as ‘a user journey towards an
information goal where each step is important,’ and showed
the importance of session context

Questions?
• We proposed three main types of scenarios of use
• We designed a series of user studies tailored to the three scenarios
• Effort is a key component of user satisfaction across the different
intelligent assistants scenarios
• We concluded that to measure good abandonment we need to investigate
the other forms of interaction signals that are not based on clicks or
reformulation
• We looked at user satisfaction as ‘a user journey towards an information
goal where each step is important,’ and showed the importance of session
context on user satisfaction
Questions?

Understanding User Satisfaction with Intelligent Assistants

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (8)

Similar to Understanding User Satisfaction with Intelligent Assistants

Similar to Understanding User Satisfaction with Intelligent Assistants (20)

More from Julia Kiseleva

More from Julia Kiseleva (9)

Recently uploaded

Recently uploaded (20)

Understanding User Satisfaction with Intelligent Assistants

Editor's Notes