SlideShare ist ein Scribd-Unternehmen logo
1 von 76
© 2017 Versay Solutions
Voice User Interface
Design:
Skills, Actions, And The Future
Crispin Reedy, Versay Solutions
@crispinTX crispinreedy.com
#BigD17
© 2017 Versay Solutions
Voice User Interface
Design:
Skills, Actions, And The Future
Disclaimer: This session was
NOT sponsored by Dominos
© 2017 Versay Solutions
• Voice User Interface Designer
• 15+ years in the field
• Former coder; got interested in UX
• President of the Association for Voice
Interaction Design
• Consultant for Versay Solutions
@crispinTX
crispinreedy.com
© 2017 Versay Solutions
Session Description
• Amazon Skills for Alexa, Google Actions for Home
– Should your company build a conversational
voice interface for one of these systems, and if
so, how?
• What are the differences between a voice user
interface and other types of UIs?
• What types of skills does a VUI designer need?
• What are some best practices for these VUIs?
• You’ll walk away with answers to the questions
“If, Why, and How” you might choose to explore
this interesting new area of design.
© 2017 Versay Solutions
Easy Answer To #1
• If your company is involved in home
automation:
• Mostly likely Yes, and Yesterday
• Although how you do it will depend on your
platform
• More on that later!
• Everyone else
• Let’s keep talking!
© 2017 Versay Solutions
Basic Terms
© 2017 Versay Solutions
Terms & Technologies
•Speech Recognition
•Natural Language Understanding
•Voice Verification (Biometrics)
•Text to Speech
© 2017 Versay Solutions
Speech Recognition “ASR”
“See the cat.”
© 2017 Versay Solutions
Natural Language Understanding
•Extracting meaning from natural text
“Hello, yes,
I’d like to
pay my
water bill.
Can you
help me with
that?
Intent =
BillPay
Entity
(Bill Type) =
Water
© 2017 Versay Solutions
Voice Verification
“My voice is
my password.”
“Authenticated.
Welcome, Mr.
Smith.”
✓
Text To Speech
© 2017 Versay Solutions
Speech Recognition
• Hands-free command /
control
• Dictation
• Input text
• Small form factor
device, etc.
Text To Speech
• Output text dynamically
• Respond to input
• Useful when no
display is available
Natural Language
Understanding
• Necessary for all
language-based input
• Extract meaning
• Parse large volumes of
text
Voice Verification
• Security
ASR
Application
Data
• Sign-In
• Interaction
• Request
• Action
• Meaning
• Access Data
• Output
TTS
NLU
Voice
prints
Verifi-
cation
© 2017 Versay Solutions
Speech Technology
Today
© 2017 Versay Solutions
Speech Agents, Apps, and APIs
Speech Agents:
• Amazon Alexa
• Echo, Dot, Echo Show
• Google Assistant
• Pixel, Android, Google Home, iPhone app
• Apple’s Siri
• iPhone, iPad, MacOS (Sierra), AppleTV
• Microsoft’s Cortana
• Windows 10, Windows Phone, Xbox, iPhone app
• Samsung’s Bixby
• Galaxy S8, Family Hub 2.0 Fridge
© 2017 Versay Solutions
Speech Agents, Apps, and APIs
Speech Agents can be extended with
“Voice Apps”
• Alexa Skills
• Google Actions
• SiriKit
• Cortana SDK
© 2017 Versay Solutions
Speech Agents, Apps, and APIs
Agent capabilities and apps are somewhat
determined by:
• Platform: Device
• Screen, keyboard, phone, mics, etc.
• Environment: Web site, apps that interact with
the agent
• Ecosystem: Underlying connections, technical
partnerships
© 2017 Versay Solutions
Platforms
© 2017 Versay Solutions
Environment
Google “Actions” or
“Apps”
• Curated
• Direct vs.
Conversational
Siri - Works via apps
Order Uber Order Lyft
© 2017 Versay Solutions New York Times
© 2017 Versay Solutions
Speech Agents, Apps, and APIs
APIs: Allow you access to the underlying
technology
• Amazon
• AVS (Alexa Voice Service) Create an “Alexa” on your
own device
• Amazon Lex, Amazon Polly
• Google
• Cloud Speech API
• API.ai
• Apple
• Apple Speech Framework
• Microsoft
• Bing Speech API
Ecobee Smart
Thermostat
© 2017 Versay Solutions
Use Cases
Use Case “Bakeoff” from Tech Insider
•Travel
•Email
•Messaging
•Sports
•Music
•Weather
•Calendar
•Social
• Translation
• Basic tasks
• General knowledge
• Personality
http://www.businessinsider.com/siri-vs-google-assistant-cortana-alexa-2016-11/
© 2017 Versay Solutions
Use Case “Bakeoff” from Tech Insider
• “wildly finicky when it comes to phrasing.”
• “Each assistant still feels like a fragile, thinly veiled
web of loosely connected services — because that's
what they are.”
• “incredibly uncomfortable to speak to an inanimate
thing in public.”
• “In Google Assistant's case, normalizing the need to
call on a brand ("OK Google") whenever you need a
hand is Orwellian.”
• “None of these things
are at a place I could
comfortably call
"good.””
© 2017 Versay Solutions
Personal Assistant vs. Home Assistant
The Google Pixel XL.
Hollis Johnson/Business Insider
Google.com
© 2017 Versay Solutions
Personal Assistant vs. Home Assistant
© 2017 Versay Solutions
Getting Specific With
Alexa
© 2017 Versay Solutions
“Layers” of Alexa
•Alexa Native Capabilities
•Alexa Skills
•Alexa Voice Services
© 2017 Versay Solutions
“Layers” of Alexa
• Alexa Native Capabilities
• Come out of the box
• Require Alexa wake word (can be changed)
• Alexa Skills
• Alexa’s “Extensions” or “Add-Ons”
• Designed for and deployed on Echo Device
• Skills must be downloaded to Echo
• Require Alexa wake word + Skill name
• Alexa Voice Services
• Add Alexa voice control to your own device
© 2017 Versay Solutions
Alexa “Native” Capabilities
Alexa, what’s 3 + 5?
Alexa, set an alarm
for 3 am.
Alexa, set a thirty
second timer.
Alexa, what’s the
weather?
Note:
Mix of TTS &
Pre-Recorded
Audio
Note:
“Hint”
© 2017 Versay Solutions
Design Considerations
•Proactive “Hints”
• Similar to “Hover Help” or “Tool Tip”
• But less avoidable!
• Pro: Can teach user about other capabilities
• Con: Can be annoying!
• Guideline: If used, be sparing
• Develop rules for when and how frequently to
offer
© 2017 Versay Solutions Amazon.com
Native & Skill
Skill
Skill
Skill
Native & Skill
Alexa Skills
© 2017 Versay Solutions Source: David Attwater, EIG Inc.
© 2017 Versay Solutions Amazon.com
Alexa Skills
Amazon.comAmazon.com
© 2017 Versay Solutions
Amazon.com
© 2017 Versay Solutions
Design Considerations
• Invoking Skills:
• Alexa, open Oprah Magazine
• Alexa, order a pizza from Domino’s
• Alexa, ask Cook Reference what’s the
safe temperature for chicken
• Syntax:
Open <skill>
Ask <skill> for (about, to, with, etc.)
<action>
Ask <skill> <question>
Also: Search, Tell, Talk to, Launch, Start,
Resume, Run, Load, Begin
Oprah Magazine
© 2017 Versay Solutions
Design Considerations
•Skills can be “installed” on the
fly
•If the user knows the name of
the skill
•Skills that require account
information will need extra
steps
Cook Reference
Domino’s
© 2017 Versay Solutions
Alexa App + Linking
© 2017 Versay Solutions
Design Considerations
•Managing access to skills may become
difficult or confusing.
© 2017 Versay Solutions
Design Considerations
•Attention (or lack of attention!) to
technical details can become “deal-
killing” part of overall experience
Domino’s
© 2017 Versay Solutions
Really?
Dominos.com
© 2017 Versay Solutions
No
Dominos.com
© 2017 Versay Solutions
Design Considerations
• Confirmation
• What’s the phone number?
• 214-555-1235
• You said 214-555-1235. Is that correct?
• Yes
• Note: System confirmed the phone number but
not the address
• Was the address really correct?
© 2017 Versay Solutions Dominos.com
© 2017 Versay Solutions
Design Considerations
• “Would you like to place your Easy
Order, reorder your most recent
order, or start a new order?”
• If I’m not logged into my account on
the Alexa app, options 1 and 2 don’t
make much sense.
• “Would you like” is ambiguous – could
be used for Yes / No questions or for
multi-item questions
• First part of the sentence runs into the
choices
• Reuse of the word “order” just seems
odd (but may be unavoidable).
• Could have used more pauses (SSML)
Domino’s
© 2017 Versay Solutions
Design Considerations: SSML
• Speech Synthesis Markup Language
• Can control the way your TTS playback sounds
• Very important if your output is mostly TTS
• Which is true of all most platforms
• Should be supported by all types of TTS
engine
• Amazon has platform specific options
• Plan on using it to fine tune your audio output
© 2017 Versay Solutions
New Prompts & SSML Examples
• Note: TTS Samples with SSML created
with Amazon Polly, not Alexa
• “You can: Place your easy order.
Reorder your most recent order. Or,
start a *new* order.”
• You can: <break time="500ms"/>Place your easy order,
<break time="500ms"/> Reorder your <emphasis
level="moderate">most recent</emphasis> order, <break
time="500ms"/> Or, start a <emphasis
level="strong">new</emphasis> order.
• Placing an order, great! Choose from:
My easy order. My most recent. Or,
start a *new* order.
• <speak>Placing an order. <prosody
pitch="high">Great!</prosody> Choose from: My easy
order. My most recent. Or, start a <emphasis> <prosody
pitch="high">new</prosody> </emphasis>
order.</speak>
Domino’s
© 2017 Versay Solutions
Still Trying To Order That Pizza
• Start of the interaction has changed!
• Probably due to login
• “Would you like to place an order, or
track an order?”
• What just happened!!!?
•System was expecting me to say
“Start a new order” and I only said
“New Order.”
Domino’s
© 2017 Versay Solutions
Design Considerations
• Make sure your input grammar covers all
possible logical utterances (what user can say)
• Don’t leave this stuff up to the programmers!
• Provide examples of coverage
• Coverage should match prompts
• Use some kind of markup to show coverage
• [] optional
• () grouping
• | or
• “Would you like to place your Easy Order,
reorder your most recent order, or start a new
order?”
• [place] [my | an] Easy Order
• [reorder] [my] most recent [order]
• [start a] new [order]
© 2017 Versay Solutions
Design Considerations
•Reprompts:
• What do you do when you didn’t
understand what the caller said?
• Probably don’t want to say “Sorry”
• This can be annoying
• But you CAN rephrase the prompt to make it
different
• Using the same prompt gives the user a sense
that something has gone wrong
© 2017 Versay Solutions
Pizza Pizza Pizza
• Hey you didn’t really need to
explain about the phone number
since I saved it but OK….
• Address has been saved to profile,
great!
• And then boom
Domino’s
© 2017 Versay Solutions
With Speech, you need to spend
a lot more time thinking about
what happens when things go
wrong.
© 2017 Versay Solutions
I Didn’t Really Want to Order Pizza
But By Now I Am Hungry
And So Is Somebody Else
• Note “Easy Order” and Credit Card
cannot be set up on the website
unless you’re actually placing an order.
• Give people enough time to talk!
• There’s that grammar coverage issue
again
• Bell pepper = Green pepper
• What synonyms is your user likely to say?
• At some point couldn’t you just give me a list?
• Notice how they screwed up the article +
the item “… adding a parmesan bread
twists”
Meow
Domino’s
© 2017 Versay Solutions
© 2017 Versay Solutions
© 2017 Versay Solutions
Design Considerations
• Confirm and correct
• “Do you want to add anything else?”
• “Yes, I want to add peppers.”
• Disambiguation
• “Olives”
• “Ok, we have two kinds of olives. Black olives, or
green olives.”
• A Voice User Interface design is a time-based
interface
• As a designer concerned with user experience
you’re going to be involved in things (such as
pauses) which may not occur to you
© 2017 Versay Solutions
How Did Google Home Do?
•“OK Google, Order Dominos”
• “There are stores at….”
• Had to go find the right “App Name” online
•“OK Google, Talk to Dominos”
• “You can link to your Domino’s account…”
• Had a terrible time finding the “Google
Apps.”
© 2017 Versay Solutions
How Did Google Home Do?
•Menu worked!
• System did not recognize “Ham” (Should
offer list of ingredients)
• System became very laggy
© 2017 Versay Solutions
How Did Google Home Do?
• Edited for time
• Original was 3:35
• This is 2:15
• Use of “Dom” persona and male voice
• “Hand off”
• Playback of address:
• Alexa: “Eighty seven twenty three”
• Google: “Eight thousand seven hundred twenty
three”
• Same issue with “twists”
• “Your day just got cheesier”
© 2017 Versay Solutions
Design Considerations
•Discoverability
• “OK Google, Order Dominos”
•Persona
• Google Home has more control over the
voice
• Branding considerations – “Dom” name and
male TTS
•Playback of Dynamic Data
• Attention to detail – don’t trust the platform
to do it the way you want it
© 2017 Versay Solutions
Design Considerations
Maintaining State:
•Between dialogs
• “Who is Seth McFarlane?”
• “Seth McFarlane is…”
• “When’s his birthday?”
• “I’m not sure what you’re talking about.”
•From session to session
Oprah Magazine
© 2017 Versay Solutions
Home Automation
•Onboarding issues are very similar to
“Skills,” but there is an additional layer of
complexity
• Companies are working to improve the
experience
• After setup, you get a lot of bang for the
buck
© 2017 Versay Solutions
“Computer,
turn on the
library lights”
© 2017 Versay Solutions
TP Link
© 2017 Versay Solutions Amazon
© 2017 Versay Solutions
Design Considerations: Summary
• Managing access to Skills
(App, Store)
• Managing the Onboarding
Experience
• Discoverability
• Invoking Skills
• Hints
• Confirmation
• Asking Yes/No Questions vs.
Multi-Item Questions
• SSML
• Silences
• Reprompting
• Coverage (prompt vs.
possible input)
• Managing technical
errors
• Timing and Timeouts
• Article matching the
noun
• Confirm and Correct
• Disambiguation
• Persona
• Playback of Dynamic
Data
• Maintaining State
© 2017 Versay Solutions
What Makes a Good VUI Designer?
•Concern with the overall experience
• All of the channels that go into making up
how something happens
•Attention to “small” technical details
• Pauses
• SSML
•Writing skills!
• Dialog, not tech doc
• English majors, screenwriters
© 2017 Versay Solutions
Session Description
• Amazon Skills for Alexa, Google Actions for Home
– Should your company build a conversational
voice interface for one of these systems, and if
so, how?
• What are the differences between a voice user
interface and other types of UIs? ✔
• What types of skills does a VUI designer need? ✔
• What are some best practices for these VUIs? ✔
• You’ll walk away with answers to the questions
“If, Why, and How” you might choose to explore
this interesting new area of design.
✔
© 2017 Versay Solutions
If, Why, How
•What are you trying to build?
•Existing guidelines / research
•User testing is key
• Especially if you’re trying to do something
complicated
© 2017 Versay Solutions
If, Why, How: Beyond Skills
Write an app (skill) for
an agent such as
Google Assistant /
Alexa
Use cloud APIs to add
ASR / NLU to your app /
device / page / gadget
Download software and
use full-featured
capabilities for more robust
recognition on a specific
device
Build your own
© 2017 Versay Solutions
If, Why, How: What’s the Use Case?
•Enabling application
• User can’t do it any other way
• New tasks
•Enhancing application
• User can do it now
• But speech makes it better
• Faster
• Safer
© 2017 Versay Solutions
API-Based
Device-
Based
Roll Your
Own /
Open-
Source
•Flexibility
•Power
•Customization
•Time
•Difficulty
© 2017 Versay Solutions
Existing Guidelines / Research
• Caveat: Best practices evolved in one
modality (e.g. voice-only) may not apply the
same way in another (e.g. combined voice +
touch)
• But they could be adapted
• Association for Voice Interaction Design
(AVIxD.org)
• Wiki
• Peer-Reviewed Journal
• Virtual “Brown Bags”
• Academic Sources, Books
© 2017 Versay Solutions
AVIxD.org
CUI Working Group is actively recruiting!
© 2017 Versay Solutions
@crispinTX
Crispin Reedy
Thank You!

Weitere ähnliche Inhalte

Was ist angesagt?

Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Speech to Speech real time translations, Aigars Macins, Skype
Speech to Speech real time translations, Aigars Macins, SkypeSpeech to Speech real time translations, Aigars Macins, Skype
Speech to Speech real time translations, Aigars Macins, SkypeTAUS - The Language Data Network
 
Speech To Sign Language Interpreter System
Speech To Sign Language Interpreter SystemSpeech To Sign Language Interpreter System
Speech To Sign Language Interpreter Systemkkkseld
 
SIXTH SENSE TECHNOLOGY REPORT
SIXTH SENSE TECHNOLOGY REPORTSIXTH SENSE TECHNOLOGY REPORT
SIXTH SENSE TECHNOLOGY REPORTJISMI JACOB
 
generative AI in healthcare.pdf
generative AI in healthcare.pdfgenerative AI in healthcare.pdf
generative AI in healthcare.pdfJamieDornan2
 
Voice morphing ppt
Voice morphing pptVoice morphing ppt
Voice morphing ppthimadrigupta
 
Gesture Recogntion Technology
Gesture Recogntion TechnologyGesture Recogntion Technology
Gesture Recogntion TechnologyMohit Sipani
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 
ppt of gesture recognition
ppt of gesture recognitionppt of gesture recognition
ppt of gesture recognitionAayush Agrawal
 
Gesture Recognition Technology
Gesture Recognition TechnologyGesture Recognition Technology
Gesture Recognition TechnologyMuhammad Zeeshan
 
Wearable Computing and Human Computer Interfaces
Wearable Computing and Human Computer InterfacesWearable Computing and Human Computer Interfaces
Wearable Computing and Human Computer InterfacesJeffrey Funk
 
Sign language translator ieee power point
Sign language translator ieee power pointSign language translator ieee power point
Sign language translator ieee power pointMadhuri Yellapu
 
Sixth sense technology
Sixth sense technologySixth sense technology
Sixth sense technologyJai Rabindra
 
Sign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols ClassificationSign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols ClassificationTriloki Gupta
 
screen less display documentation
screen less display documentationscreen less display documentation
screen less display documentationmani akuthota
 
Hand Gesture Recognition Applications
Hand Gesture Recognition ApplicationsHand Gesture Recognition Applications
Hand Gesture Recognition ApplicationsImon_Barua
 
Chatbot Artificial Intelligence
Chatbot Artificial IntelligenceChatbot Artificial Intelligence
Chatbot Artificial IntelligenceMd. Mahedi Mahfuj
 

Was ist angesagt? (20)

Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech to Speech real time translations, Aigars Macins, Skype
Speech to Speech real time translations, Aigars Macins, SkypeSpeech to Speech real time translations, Aigars Macins, Skype
Speech to Speech real time translations, Aigars Macins, Skype
 
Speech To Sign Language Interpreter System
Speech To Sign Language Interpreter SystemSpeech To Sign Language Interpreter System
Speech To Sign Language Interpreter System
 
SIXTH SENSE TECHNOLOGY REPORT
SIXTH SENSE TECHNOLOGY REPORTSIXTH SENSE TECHNOLOGY REPORT
SIXTH SENSE TECHNOLOGY REPORT
 
Agent Oriented Programming PPT
Agent Oriented Programming PPTAgent Oriented Programming PPT
Agent Oriented Programming PPT
 
generative AI in healthcare.pdf
generative AI in healthcare.pdfgenerative AI in healthcare.pdf
generative AI in healthcare.pdf
 
Voice morphing ppt
Voice morphing pptVoice morphing ppt
Voice morphing ppt
 
Gesture Recogntion Technology
Gesture Recogntion TechnologyGesture Recogntion Technology
Gesture Recogntion Technology
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Blue eye technology ppt
Blue eye technology pptBlue eye technology ppt
Blue eye technology ppt
 
ppt of gesture recognition
ppt of gesture recognitionppt of gesture recognition
ppt of gesture recognition
 
Gesture Recognition Technology
Gesture Recognition TechnologyGesture Recognition Technology
Gesture Recognition Technology
 
Wearable Computing and Human Computer Interfaces
Wearable Computing and Human Computer InterfacesWearable Computing and Human Computer Interfaces
Wearable Computing and Human Computer Interfaces
 
Sign language translator ieee power point
Sign language translator ieee power pointSign language translator ieee power point
Sign language translator ieee power point
 
Sixth sense technology
Sixth sense technologySixth sense technology
Sixth sense technology
 
Sign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols ClassificationSign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols Classification
 
Voice Enabled Chat Bots
Voice Enabled Chat BotsVoice Enabled Chat Bots
Voice Enabled Chat Bots
 
screen less display documentation
screen less display documentationscreen less display documentation
screen less display documentation
 
Hand Gesture Recognition Applications
Hand Gesture Recognition ApplicationsHand Gesture Recognition Applications
Hand Gesture Recognition Applications
 
Chatbot Artificial Intelligence
Chatbot Artificial IntelligenceChatbot Artificial Intelligence
Chatbot Artificial Intelligence
 

Ähnlich wie Voice Interface Design: Skills, Actions, Tips for Building Voice Apps

Conversational User Interfaces, Past and Future
Conversational User Interfaces, Past and FutureConversational User Interfaces, Past and Future
Conversational User Interfaces, Past and FutureCrispin Reedy
 
DEV206_Life of a Code Change to a Tier 1 Service
DEV206_Life of a Code Change to a Tier 1 ServiceDEV206_Life of a Code Change to a Tier 1 Service
DEV206_Life of a Code Change to a Tier 1 ServiceAmazon Web Services
 
MCL301_Building a Voice-Enabled Customer Service Chatbot Using Amazon Lex and...
MCL301_Building a Voice-Enabled Customer Service Chatbot Using Amazon Lex and...MCL301_Building a Voice-Enabled Customer Service Chatbot Using Amazon Lex and...
MCL301_Building a Voice-Enabled Customer Service Chatbot Using Amazon Lex and...Amazon Web Services
 
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...Amazon Web Services
 
NEW LAUNCH! Deploy Alexa In Your Organization with Alexa for Business - BAP20...
NEW LAUNCH! Deploy Alexa In Your Organization with Alexa for Business - BAP20...NEW LAUNCH! Deploy Alexa In Your Organization with Alexa for Business - BAP20...
NEW LAUNCH! Deploy Alexa In Your Organization with Alexa for Business - BAP20...Amazon Web Services
 
ARC214_Addressing Your Business Needs with AWS
ARC214_Addressing Your Business Needs with AWSARC214_Addressing Your Business Needs with AWS
ARC214_Addressing Your Business Needs with AWSAmazon Web Services
 
AWS FSI Symposium 2017 NYC - Amazon AI for FS
AWS FSI Symposium 2017 NYC - Amazon AI for FSAWS FSI Symposium 2017 NYC - Amazon AI for FS
AWS FSI Symposium 2017 NYC - Amazon AI for FSAmazon Web Services
 
NEW LAUNCH! Bring Alexa to Work! Voice-enable Your Organization with Alexa fo...
NEW LAUNCH! Bring Alexa to Work! Voice-enable Your Organization with Alexa fo...NEW LAUNCH! Bring Alexa to Work! Voice-enable Your Organization with Alexa fo...
NEW LAUNCH! Bring Alexa to Work! Voice-enable Your Organization with Alexa fo...Amazon Web Services
 
AWS 機器學習 I ─ 人工智慧 AI
AWS 機器學習 I ─ 人工智慧 AIAWS 機器學習 I ─ 人工智慧 AI
AWS 機器學習 I ─ 人工智慧 AIAmazon Web Services
 
729 Solutions Helps Connect The Dots - Our Services At A Glance
729 Solutions Helps Connect The Dots - Our Services At A Glance729 Solutions Helps Connect The Dots - Our Services At A Glance
729 Solutions Helps Connect The Dots - Our Services At A GlanceIron Mountain
 
Tools of a Successful Force.com Developer
Tools of a Successful Force.com DeveloperTools of a Successful Force.com Developer
Tools of a Successful Force.com DeveloperJoshua Hoskins
 
OpenNTF Webinar, March, 2021
OpenNTF Webinar, March, 2021OpenNTF Webinar, March, 2021
OpenNTF Webinar, March, 2021Howard Greenberg
 
IOT328_Building an AWS IoT-Enabled Drink Dispenser
IOT328_Building an AWS IoT-Enabled Drink DispenserIOT328_Building an AWS IoT-Enabled Drink Dispenser
IOT328_Building an AWS IoT-Enabled Drink DispenserAmazon Web Services
 
IOT328_Building an AWS IoT-Enabled Drink Dispenser
IOT328_Building an AWS IoT-Enabled Drink DispenserIOT328_Building an AWS IoT-Enabled Drink Dispenser
IOT328_Building an AWS IoT-Enabled Drink DispenserAmazon Web Services
 
Building an AWS IoT-Enabled Drink Dispenser - IOT328 - re:Invent 2017
Building an AWS IoT-Enabled Drink Dispenser - IOT328 - re:Invent 2017Building an AWS IoT-Enabled Drink Dispenser - IOT328 - re:Invent 2017
Building an AWS IoT-Enabled Drink Dispenser - IOT328 - re:Invent 2017Amazon Web Services
 
MCL331_Building a Virtual Assistant with Amazon Polly and Amazon Lex Pollexy
MCL331_Building a Virtual Assistant with Amazon Polly and Amazon Lex PollexyMCL331_Building a Virtual Assistant with Amazon Polly and Amazon Lex Pollexy
MCL331_Building a Virtual Assistant with Amazon Polly and Amazon Lex PollexyAmazon Web Services
 
Secrets Of Web Company Profile
Secrets Of Web Company ProfileSecrets Of Web Company Profile
Secrets Of Web Company ProfileSecrets Of Web
 
How to Implement Domain Driven Design in Real Life SDLC
How to Implement Domain Driven Design  in Real Life SDLCHow to Implement Domain Driven Design  in Real Life SDLC
How to Implement Domain Driven Design in Real Life SDLCAbdul Karim
 
Increasing Website Sales & Conversions with Brad Hauck
Increasing Website Sales & Conversions with Brad HauckIncreasing Website Sales & Conversions with Brad Hauck
Increasing Website Sales & Conversions with Brad HauckBrad Hauck
 
Touch Screens | The very expensive mistake
Touch Screens | The very expensive mistakeTouch Screens | The very expensive mistake
Touch Screens | The very expensive mistakeCasey Dubbs
 

Ähnlich wie Voice Interface Design: Skills, Actions, Tips for Building Voice Apps (20)

Conversational User Interfaces, Past and Future
Conversational User Interfaces, Past and FutureConversational User Interfaces, Past and Future
Conversational User Interfaces, Past and Future
 
DEV206_Life of a Code Change to a Tier 1 Service
DEV206_Life of a Code Change to a Tier 1 ServiceDEV206_Life of a Code Change to a Tier 1 Service
DEV206_Life of a Code Change to a Tier 1 Service
 
MCL301_Building a Voice-Enabled Customer Service Chatbot Using Amazon Lex and...
MCL301_Building a Voice-Enabled Customer Service Chatbot Using Amazon Lex and...MCL301_Building a Voice-Enabled Customer Service Chatbot Using Amazon Lex and...
MCL301_Building a Voice-Enabled Customer Service Chatbot Using Amazon Lex and...
 
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
 
NEW LAUNCH! Deploy Alexa In Your Organization with Alexa for Business - BAP20...
NEW LAUNCH! Deploy Alexa In Your Organization with Alexa for Business - BAP20...NEW LAUNCH! Deploy Alexa In Your Organization with Alexa for Business - BAP20...
NEW LAUNCH! Deploy Alexa In Your Organization with Alexa for Business - BAP20...
 
ARC214_Addressing Your Business Needs with AWS
ARC214_Addressing Your Business Needs with AWSARC214_Addressing Your Business Needs with AWS
ARC214_Addressing Your Business Needs with AWS
 
AWS FSI Symposium 2017 NYC - Amazon AI for FS
AWS FSI Symposium 2017 NYC - Amazon AI for FSAWS FSI Symposium 2017 NYC - Amazon AI for FS
AWS FSI Symposium 2017 NYC - Amazon AI for FS
 
NEW LAUNCH! Bring Alexa to Work! Voice-enable Your Organization with Alexa fo...
NEW LAUNCH! Bring Alexa to Work! Voice-enable Your Organization with Alexa fo...NEW LAUNCH! Bring Alexa to Work! Voice-enable Your Organization with Alexa fo...
NEW LAUNCH! Bring Alexa to Work! Voice-enable Your Organization with Alexa fo...
 
AWS 機器學習 I ─ 人工智慧 AI
AWS 機器學習 I ─ 人工智慧 AIAWS 機器學習 I ─ 人工智慧 AI
AWS 機器學習 I ─ 人工智慧 AI
 
729 Solutions Helps Connect The Dots - Our Services At A Glance
729 Solutions Helps Connect The Dots - Our Services At A Glance729 Solutions Helps Connect The Dots - Our Services At A Glance
729 Solutions Helps Connect The Dots - Our Services At A Glance
 
Tools of a Successful Force.com Developer
Tools of a Successful Force.com DeveloperTools of a Successful Force.com Developer
Tools of a Successful Force.com Developer
 
OpenNTF Webinar, March, 2021
OpenNTF Webinar, March, 2021OpenNTF Webinar, March, 2021
OpenNTF Webinar, March, 2021
 
IOT328_Building an AWS IoT-Enabled Drink Dispenser
IOT328_Building an AWS IoT-Enabled Drink DispenserIOT328_Building an AWS IoT-Enabled Drink Dispenser
IOT328_Building an AWS IoT-Enabled Drink Dispenser
 
IOT328_Building an AWS IoT-Enabled Drink Dispenser
IOT328_Building an AWS IoT-Enabled Drink DispenserIOT328_Building an AWS IoT-Enabled Drink Dispenser
IOT328_Building an AWS IoT-Enabled Drink Dispenser
 
Building an AWS IoT-Enabled Drink Dispenser - IOT328 - re:Invent 2017
Building an AWS IoT-Enabled Drink Dispenser - IOT328 - re:Invent 2017Building an AWS IoT-Enabled Drink Dispenser - IOT328 - re:Invent 2017
Building an AWS IoT-Enabled Drink Dispenser - IOT328 - re:Invent 2017
 
MCL331_Building a Virtual Assistant with Amazon Polly and Amazon Lex Pollexy
MCL331_Building a Virtual Assistant with Amazon Polly and Amazon Lex PollexyMCL331_Building a Virtual Assistant with Amazon Polly and Amazon Lex Pollexy
MCL331_Building a Virtual Assistant with Amazon Polly and Amazon Lex Pollexy
 
Secrets Of Web Company Profile
Secrets Of Web Company ProfileSecrets Of Web Company Profile
Secrets Of Web Company Profile
 
How to Implement Domain Driven Design in Real Life SDLC
How to Implement Domain Driven Design  in Real Life SDLCHow to Implement Domain Driven Design  in Real Life SDLC
How to Implement Domain Driven Design in Real Life SDLC
 
Increasing Website Sales & Conversions with Brad Hauck
Increasing Website Sales & Conversions with Brad HauckIncreasing Website Sales & Conversions with Brad Hauck
Increasing Website Sales & Conversions with Brad Hauck
 
Touch Screens | The very expensive mistake
Touch Screens | The very expensive mistakeTouch Screens | The very expensive mistake
Touch Screens | The very expensive mistake
 

Mehr von Crispin Reedy

Association for Voice Interaction Design - Annual Meeting 2018
Association for Voice Interaction Design - Annual Meeting 2018Association for Voice Interaction Design - Annual Meeting 2018
Association for Voice Interaction Design - Annual Meeting 2018Crispin Reedy
 
Adding Visuals to Voice Panel - SpeechTEK 2017
Adding Visuals to Voice Panel - SpeechTEK 2017Adding Visuals to Voice Panel - SpeechTEK 2017
Adding Visuals to Voice Panel - SpeechTEK 2017Crispin Reedy
 
Chatbots vs. Voicebots Sunrise Session SpeechTEK 2017-final
Chatbots vs. Voicebots Sunrise Session SpeechTEK 2017-finalChatbots vs. Voicebots Sunrise Session SpeechTEK 2017-final
Chatbots vs. Voicebots Sunrise Session SpeechTEK 2017-finalCrispin Reedy
 
Association for Voice Interaction Design Annual Meeting 2017
Association for Voice Interaction Design Annual Meeting 2017Association for Voice Interaction Design Annual Meeting 2017
Association for Voice Interaction Design Annual Meeting 2017Crispin Reedy
 
Where's Jarvis? The Future of Voice Recognition and Natural Language User In...
Where's Jarvis?  The Future of Voice Recognition and Natural Language User In...Where's Jarvis?  The Future of Voice Recognition and Natural Language User In...
Where's Jarvis? The Future of Voice Recognition and Natural Language User In...Crispin Reedy
 
Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Crispin Reedy
 
Top 10 Tips for Making Complicated Things Simple
Top 10 Tips for Making Complicated Things SimpleTop 10 Tips for Making Complicated Things Simple
Top 10 Tips for Making Complicated Things SimpleCrispin Reedy
 
Association for Voice Interaction Design Annual Meeting 2016
Association for Voice Interaction Design Annual Meeting 2016Association for Voice Interaction Design Annual Meeting 2016
Association for Voice Interaction Design Annual Meeting 2016Crispin Reedy
 
Going Solo: Design and Productivity Techniques for the Team of One
Going Solo: Design and Productivity Techniques for the Team of OneGoing Solo: Design and Productivity Techniques for the Team of One
Going Solo: Design and Productivity Techniques for the Team of OneCrispin Reedy
 
Service Design and the Omnichannel Experience - SpeechTEK 2015
Service Design and the Omnichannel Experience - SpeechTEK 2015Service Design and the Omnichannel Experience - SpeechTEK 2015
Service Design and the Omnichannel Experience - SpeechTEK 2015Crispin Reedy
 
Association for Voice Interaction Design Annual Meeting 2015
Association for Voice Interaction Design Annual Meeting 2015Association for Voice Interaction Design Annual Meeting 2015
Association for Voice Interaction Design Annual Meeting 2015Crispin Reedy
 
SpeechTEK University Outtakes 2014: Zero Out Strategies
SpeechTEK University Outtakes 2014: Zero Out StrategiesSpeechTEK University Outtakes 2014: Zero Out Strategies
SpeechTEK University Outtakes 2014: Zero Out StrategiesCrispin Reedy
 
2013 Speech TEK - Alphanumeric Recognition Discussion
2013 Speech TEK - Alphanumeric Recognition Discussion2013 Speech TEK - Alphanumeric Recognition Discussion
2013 Speech TEK - Alphanumeric Recognition DiscussionCrispin Reedy
 
Design Thinking Action Lab Exercise 1
Design Thinking Action Lab Exercise 1Design Thinking Action Lab Exercise 1
Design Thinking Action Lab Exercise 1Crispin Reedy
 

Mehr von Crispin Reedy (15)

Association for Voice Interaction Design - Annual Meeting 2018
Association for Voice Interaction Design - Annual Meeting 2018Association for Voice Interaction Design - Annual Meeting 2018
Association for Voice Interaction Design - Annual Meeting 2018
 
Assertive Niceness
Assertive NicenessAssertive Niceness
Assertive Niceness
 
Adding Visuals to Voice Panel - SpeechTEK 2017
Adding Visuals to Voice Panel - SpeechTEK 2017Adding Visuals to Voice Panel - SpeechTEK 2017
Adding Visuals to Voice Panel - SpeechTEK 2017
 
Chatbots vs. Voicebots Sunrise Session SpeechTEK 2017-final
Chatbots vs. Voicebots Sunrise Session SpeechTEK 2017-finalChatbots vs. Voicebots Sunrise Session SpeechTEK 2017-final
Chatbots vs. Voicebots Sunrise Session SpeechTEK 2017-final
 
Association for Voice Interaction Design Annual Meeting 2017
Association for Voice Interaction Design Annual Meeting 2017Association for Voice Interaction Design Annual Meeting 2017
Association for Voice Interaction Design Annual Meeting 2017
 
Where's Jarvis? The Future of Voice Recognition and Natural Language User In...
Where's Jarvis?  The Future of Voice Recognition and Natural Language User In...Where's Jarvis?  The Future of Voice Recognition and Natural Language User In...
Where's Jarvis? The Future of Voice Recognition and Natural Language User In...
 
Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016
 
Top 10 Tips for Making Complicated Things Simple
Top 10 Tips for Making Complicated Things SimpleTop 10 Tips for Making Complicated Things Simple
Top 10 Tips for Making Complicated Things Simple
 
Association for Voice Interaction Design Annual Meeting 2016
Association for Voice Interaction Design Annual Meeting 2016Association for Voice Interaction Design Annual Meeting 2016
Association for Voice Interaction Design Annual Meeting 2016
 
Going Solo: Design and Productivity Techniques for the Team of One
Going Solo: Design and Productivity Techniques for the Team of OneGoing Solo: Design and Productivity Techniques for the Team of One
Going Solo: Design and Productivity Techniques for the Team of One
 
Service Design and the Omnichannel Experience - SpeechTEK 2015
Service Design and the Omnichannel Experience - SpeechTEK 2015Service Design and the Omnichannel Experience - SpeechTEK 2015
Service Design and the Omnichannel Experience - SpeechTEK 2015
 
Association for Voice Interaction Design Annual Meeting 2015
Association for Voice Interaction Design Annual Meeting 2015Association for Voice Interaction Design Annual Meeting 2015
Association for Voice Interaction Design Annual Meeting 2015
 
SpeechTEK University Outtakes 2014: Zero Out Strategies
SpeechTEK University Outtakes 2014: Zero Out StrategiesSpeechTEK University Outtakes 2014: Zero Out Strategies
SpeechTEK University Outtakes 2014: Zero Out Strategies
 
2013 Speech TEK - Alphanumeric Recognition Discussion
2013 Speech TEK - Alphanumeric Recognition Discussion2013 Speech TEK - Alphanumeric Recognition Discussion
2013 Speech TEK - Alphanumeric Recognition Discussion
 
Design Thinking Action Lab Exercise 1
Design Thinking Action Lab Exercise 1Design Thinking Action Lab Exercise 1
Design Thinking Action Lab Exercise 1
 

Kürzlich hochgeladen

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Kürzlich hochgeladen (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Voice Interface Design: Skills, Actions, Tips for Building Voice Apps

  • 1. © 2017 Versay Solutions Voice User Interface Design: Skills, Actions, And The Future Crispin Reedy, Versay Solutions @crispinTX crispinreedy.com #BigD17
  • 2. © 2017 Versay Solutions Voice User Interface Design: Skills, Actions, And The Future Disclaimer: This session was NOT sponsored by Dominos
  • 3. © 2017 Versay Solutions • Voice User Interface Designer • 15+ years in the field • Former coder; got interested in UX • President of the Association for Voice Interaction Design • Consultant for Versay Solutions @crispinTX crispinreedy.com
  • 4. © 2017 Versay Solutions Session Description • Amazon Skills for Alexa, Google Actions for Home – Should your company build a conversational voice interface for one of these systems, and if so, how? • What are the differences between a voice user interface and other types of UIs? • What types of skills does a VUI designer need? • What are some best practices for these VUIs? • You’ll walk away with answers to the questions “If, Why, and How” you might choose to explore this interesting new area of design.
  • 5. © 2017 Versay Solutions Easy Answer To #1 • If your company is involved in home automation: • Mostly likely Yes, and Yesterday • Although how you do it will depend on your platform • More on that later! • Everyone else • Let’s keep talking!
  • 6. © 2017 Versay Solutions Basic Terms
  • 7. © 2017 Versay Solutions Terms & Technologies •Speech Recognition •Natural Language Understanding •Voice Verification (Biometrics) •Text to Speech
  • 8. © 2017 Versay Solutions Speech Recognition “ASR” “See the cat.”
  • 9. © 2017 Versay Solutions Natural Language Understanding •Extracting meaning from natural text “Hello, yes, I’d like to pay my water bill. Can you help me with that? Intent = BillPay Entity (Bill Type) = Water
  • 10. © 2017 Versay Solutions Voice Verification “My voice is my password.” “Authenticated. Welcome, Mr. Smith.” ✓
  • 12. © 2017 Versay Solutions Speech Recognition • Hands-free command / control • Dictation • Input text • Small form factor device, etc. Text To Speech • Output text dynamically • Respond to input • Useful when no display is available Natural Language Understanding • Necessary for all language-based input • Extract meaning • Parse large volumes of text Voice Verification • Security
  • 13. ASR Application Data • Sign-In • Interaction • Request • Action • Meaning • Access Data • Output TTS NLU Voice prints Verifi- cation
  • 14. © 2017 Versay Solutions Speech Technology Today
  • 15. © 2017 Versay Solutions Speech Agents, Apps, and APIs Speech Agents: • Amazon Alexa • Echo, Dot, Echo Show • Google Assistant • Pixel, Android, Google Home, iPhone app • Apple’s Siri • iPhone, iPad, MacOS (Sierra), AppleTV • Microsoft’s Cortana • Windows 10, Windows Phone, Xbox, iPhone app • Samsung’s Bixby • Galaxy S8, Family Hub 2.0 Fridge
  • 16. © 2017 Versay Solutions Speech Agents, Apps, and APIs Speech Agents can be extended with “Voice Apps” • Alexa Skills • Google Actions • SiriKit • Cortana SDK
  • 17. © 2017 Versay Solutions Speech Agents, Apps, and APIs Agent capabilities and apps are somewhat determined by: • Platform: Device • Screen, keyboard, phone, mics, etc. • Environment: Web site, apps that interact with the agent • Ecosystem: Underlying connections, technical partnerships
  • 18. © 2017 Versay Solutions Platforms
  • 19. © 2017 Versay Solutions Environment Google “Actions” or “Apps” • Curated • Direct vs. Conversational Siri - Works via apps Order Uber Order Lyft
  • 20. © 2017 Versay Solutions New York Times
  • 21. © 2017 Versay Solutions Speech Agents, Apps, and APIs APIs: Allow you access to the underlying technology • Amazon • AVS (Alexa Voice Service) Create an “Alexa” on your own device • Amazon Lex, Amazon Polly • Google • Cloud Speech API • API.ai • Apple • Apple Speech Framework • Microsoft • Bing Speech API Ecobee Smart Thermostat
  • 22. © 2017 Versay Solutions Use Cases
  • 23. Use Case “Bakeoff” from Tech Insider •Travel •Email •Messaging •Sports •Music •Weather •Calendar •Social • Translation • Basic tasks • General knowledge • Personality http://www.businessinsider.com/siri-vs-google-assistant-cortana-alexa-2016-11/
  • 24. © 2017 Versay Solutions Use Case “Bakeoff” from Tech Insider • “wildly finicky when it comes to phrasing.” • “Each assistant still feels like a fragile, thinly veiled web of loosely connected services — because that's what they are.” • “incredibly uncomfortable to speak to an inanimate thing in public.” • “In Google Assistant's case, normalizing the need to call on a brand ("OK Google") whenever you need a hand is Orwellian.” • “None of these things are at a place I could comfortably call "good.””
  • 25. © 2017 Versay Solutions Personal Assistant vs. Home Assistant The Google Pixel XL. Hollis Johnson/Business Insider Google.com
  • 26. © 2017 Versay Solutions Personal Assistant vs. Home Assistant
  • 27. © 2017 Versay Solutions Getting Specific With Alexa
  • 28. © 2017 Versay Solutions “Layers” of Alexa •Alexa Native Capabilities •Alexa Skills •Alexa Voice Services
  • 29. © 2017 Versay Solutions “Layers” of Alexa • Alexa Native Capabilities • Come out of the box • Require Alexa wake word (can be changed) • Alexa Skills • Alexa’s “Extensions” or “Add-Ons” • Designed for and deployed on Echo Device • Skills must be downloaded to Echo • Require Alexa wake word + Skill name • Alexa Voice Services • Add Alexa voice control to your own device
  • 30. © 2017 Versay Solutions Alexa “Native” Capabilities Alexa, what’s 3 + 5? Alexa, set an alarm for 3 am. Alexa, set a thirty second timer. Alexa, what’s the weather? Note: Mix of TTS & Pre-Recorded Audio Note: “Hint”
  • 31. © 2017 Versay Solutions Design Considerations •Proactive “Hints” • Similar to “Hover Help” or “Tool Tip” • But less avoidable! • Pro: Can teach user about other capabilities • Con: Can be annoying! • Guideline: If used, be sparing • Develop rules for when and how frequently to offer
  • 32. © 2017 Versay Solutions Amazon.com Native & Skill Skill Skill Skill Native & Skill Alexa Skills
  • 33. © 2017 Versay Solutions Source: David Attwater, EIG Inc.
  • 34. © 2017 Versay Solutions Amazon.com
  • 36. © 2017 Versay Solutions Amazon.com
  • 37. © 2017 Versay Solutions Design Considerations • Invoking Skills: • Alexa, open Oprah Magazine • Alexa, order a pizza from Domino’s • Alexa, ask Cook Reference what’s the safe temperature for chicken • Syntax: Open <skill> Ask <skill> for (about, to, with, etc.) <action> Ask <skill> <question> Also: Search, Tell, Talk to, Launch, Start, Resume, Run, Load, Begin Oprah Magazine
  • 38. © 2017 Versay Solutions Design Considerations •Skills can be “installed” on the fly •If the user knows the name of the skill •Skills that require account information will need extra steps Cook Reference Domino’s
  • 39. © 2017 Versay Solutions Alexa App + Linking
  • 40. © 2017 Versay Solutions Design Considerations •Managing access to skills may become difficult or confusing.
  • 41. © 2017 Versay Solutions Design Considerations •Attention (or lack of attention!) to technical details can become “deal- killing” part of overall experience Domino’s
  • 42. © 2017 Versay Solutions Really? Dominos.com
  • 43. © 2017 Versay Solutions No Dominos.com
  • 44. © 2017 Versay Solutions Design Considerations • Confirmation • What’s the phone number? • 214-555-1235 • You said 214-555-1235. Is that correct? • Yes • Note: System confirmed the phone number but not the address • Was the address really correct?
  • 45. © 2017 Versay Solutions Dominos.com
  • 46. © 2017 Versay Solutions Design Considerations • “Would you like to place your Easy Order, reorder your most recent order, or start a new order?” • If I’m not logged into my account on the Alexa app, options 1 and 2 don’t make much sense. • “Would you like” is ambiguous – could be used for Yes / No questions or for multi-item questions • First part of the sentence runs into the choices • Reuse of the word “order” just seems odd (but may be unavoidable). • Could have used more pauses (SSML) Domino’s
  • 47. © 2017 Versay Solutions Design Considerations: SSML • Speech Synthesis Markup Language • Can control the way your TTS playback sounds • Very important if your output is mostly TTS • Which is true of all most platforms • Should be supported by all types of TTS engine • Amazon has platform specific options • Plan on using it to fine tune your audio output
  • 48. © 2017 Versay Solutions New Prompts & SSML Examples • Note: TTS Samples with SSML created with Amazon Polly, not Alexa • “You can: Place your easy order. Reorder your most recent order. Or, start a *new* order.” • You can: <break time="500ms"/>Place your easy order, <break time="500ms"/> Reorder your <emphasis level="moderate">most recent</emphasis> order, <break time="500ms"/> Or, start a <emphasis level="strong">new</emphasis> order. • Placing an order, great! Choose from: My easy order. My most recent. Or, start a *new* order. • <speak>Placing an order. <prosody pitch="high">Great!</prosody> Choose from: My easy order. My most recent. Or, start a <emphasis> <prosody pitch="high">new</prosody> </emphasis> order.</speak> Domino’s
  • 49. © 2017 Versay Solutions Still Trying To Order That Pizza • Start of the interaction has changed! • Probably due to login • “Would you like to place an order, or track an order?” • What just happened!!!? •System was expecting me to say “Start a new order” and I only said “New Order.” Domino’s
  • 50. © 2017 Versay Solutions Design Considerations • Make sure your input grammar covers all possible logical utterances (what user can say) • Don’t leave this stuff up to the programmers! • Provide examples of coverage • Coverage should match prompts • Use some kind of markup to show coverage • [] optional • () grouping • | or • “Would you like to place your Easy Order, reorder your most recent order, or start a new order?” • [place] [my | an] Easy Order • [reorder] [my] most recent [order] • [start a] new [order]
  • 51. © 2017 Versay Solutions Design Considerations •Reprompts: • What do you do when you didn’t understand what the caller said? • Probably don’t want to say “Sorry” • This can be annoying • But you CAN rephrase the prompt to make it different • Using the same prompt gives the user a sense that something has gone wrong
  • 52. © 2017 Versay Solutions Pizza Pizza Pizza • Hey you didn’t really need to explain about the phone number since I saved it but OK…. • Address has been saved to profile, great! • And then boom Domino’s
  • 53. © 2017 Versay Solutions With Speech, you need to spend a lot more time thinking about what happens when things go wrong.
  • 54. © 2017 Versay Solutions I Didn’t Really Want to Order Pizza But By Now I Am Hungry And So Is Somebody Else • Note “Easy Order” and Credit Card cannot be set up on the website unless you’re actually placing an order. • Give people enough time to talk! • There’s that grammar coverage issue again • Bell pepper = Green pepper • What synonyms is your user likely to say? • At some point couldn’t you just give me a list? • Notice how they screwed up the article + the item “… adding a parmesan bread twists” Meow Domino’s
  • 55. © 2017 Versay Solutions
  • 56. © 2017 Versay Solutions
  • 57. © 2017 Versay Solutions Design Considerations • Confirm and correct • “Do you want to add anything else?” • “Yes, I want to add peppers.” • Disambiguation • “Olives” • “Ok, we have two kinds of olives. Black olives, or green olives.” • A Voice User Interface design is a time-based interface • As a designer concerned with user experience you’re going to be involved in things (such as pauses) which may not occur to you
  • 58. © 2017 Versay Solutions How Did Google Home Do? •“OK Google, Order Dominos” • “There are stores at….” • Had to go find the right “App Name” online •“OK Google, Talk to Dominos” • “You can link to your Domino’s account…” • Had a terrible time finding the “Google Apps.”
  • 59. © 2017 Versay Solutions How Did Google Home Do? •Menu worked! • System did not recognize “Ham” (Should offer list of ingredients) • System became very laggy
  • 60. © 2017 Versay Solutions How Did Google Home Do? • Edited for time • Original was 3:35 • This is 2:15 • Use of “Dom” persona and male voice • “Hand off” • Playback of address: • Alexa: “Eighty seven twenty three” • Google: “Eight thousand seven hundred twenty three” • Same issue with “twists” • “Your day just got cheesier”
  • 61. © 2017 Versay Solutions Design Considerations •Discoverability • “OK Google, Order Dominos” •Persona • Google Home has more control over the voice • Branding considerations – “Dom” name and male TTS •Playback of Dynamic Data • Attention to detail – don’t trust the platform to do it the way you want it
  • 62. © 2017 Versay Solutions Design Considerations Maintaining State: •Between dialogs • “Who is Seth McFarlane?” • “Seth McFarlane is…” • “When’s his birthday?” • “I’m not sure what you’re talking about.” •From session to session Oprah Magazine
  • 63. © 2017 Versay Solutions Home Automation •Onboarding issues are very similar to “Skills,” but there is an additional layer of complexity • Companies are working to improve the experience • After setup, you get a lot of bang for the buck
  • 64. © 2017 Versay Solutions “Computer, turn on the library lights”
  • 65. © 2017 Versay Solutions TP Link
  • 66. © 2017 Versay Solutions Amazon
  • 67. © 2017 Versay Solutions Design Considerations: Summary • Managing access to Skills (App, Store) • Managing the Onboarding Experience • Discoverability • Invoking Skills • Hints • Confirmation • Asking Yes/No Questions vs. Multi-Item Questions • SSML • Silences • Reprompting • Coverage (prompt vs. possible input) • Managing technical errors • Timing and Timeouts • Article matching the noun • Confirm and Correct • Disambiguation • Persona • Playback of Dynamic Data • Maintaining State
  • 68. © 2017 Versay Solutions What Makes a Good VUI Designer? •Concern with the overall experience • All of the channels that go into making up how something happens •Attention to “small” technical details • Pauses • SSML •Writing skills! • Dialog, not tech doc • English majors, screenwriters
  • 69. © 2017 Versay Solutions Session Description • Amazon Skills for Alexa, Google Actions for Home – Should your company build a conversational voice interface for one of these systems, and if so, how? • What are the differences between a voice user interface and other types of UIs? ✔ • What types of skills does a VUI designer need? ✔ • What are some best practices for these VUIs? ✔ • You’ll walk away with answers to the questions “If, Why, and How” you might choose to explore this interesting new area of design. ✔
  • 70. © 2017 Versay Solutions If, Why, How •What are you trying to build? •Existing guidelines / research •User testing is key • Especially if you’re trying to do something complicated
  • 71. © 2017 Versay Solutions If, Why, How: Beyond Skills Write an app (skill) for an agent such as Google Assistant / Alexa Use cloud APIs to add ASR / NLU to your app / device / page / gadget Download software and use full-featured capabilities for more robust recognition on a specific device Build your own
  • 72. © 2017 Versay Solutions If, Why, How: What’s the Use Case? •Enabling application • User can’t do it any other way • New tasks •Enhancing application • User can do it now • But speech makes it better • Faster • Safer
  • 73. © 2017 Versay Solutions API-Based Device- Based Roll Your Own / Open- Source •Flexibility •Power •Customization •Time •Difficulty
  • 74. © 2017 Versay Solutions Existing Guidelines / Research • Caveat: Best practices evolved in one modality (e.g. voice-only) may not apply the same way in another (e.g. combined voice + touch) • But they could be adapted • Association for Voice Interaction Design (AVIxD.org) • Wiki • Peer-Reviewed Journal • Virtual “Brown Bags” • Academic Sources, Books
  • 75. © 2017 Versay Solutions AVIxD.org CUI Working Group is actively recruiting!
  • 76. © 2017 Versay Solutions @crispinTX Crispin Reedy Thank You!

Hinweis der Redaktion

  1. DO NOT FORGET TO BRING THE MINI-SPEAKERS!!!
  2. “Speech to Text” ? Spoken Language – Machine readable format
  3. Not necessarily tied to speech recognition
  4. Also called voiceprints, biometrics, voice authentication, etc. Not going to discuss this one in a lot of detail today but it’s important that you understand the difference between these technologies. Recognizes a person, not necessarily what they are saying. You can have ASR without Voice Verification And vice versa
  5. Human voice talent Hundreds of hours of recording Digitized Phonemes: Concatenated speech synthesis
  6. Alexa, Ask Capitol One What’s my current credit card balance?
  7. What do you need it for? What kind of device will you be running it on? Connectivity? Can you use cloud based ASR? How much control do you need over the application / user interface?
  8. DO NOT FORGET TO BRING THE MINI-SPEAKERS!!!