12. On Echo Show
Alexa, where's the nearest swimming pool?
Here are your nearby pools. Which one do
you to see
Alexa, select number two.
here’s more about 〜〜〜
13. What can we do with the screen
background image
Display of long sentences
List interface
audio player controller
video player
touch interface
14. control screen by voice
scroll up / down
scroll right / left
page up / down
next / back
20. body template 2
Single image on right side
Rich or Plain text
Background image
specific entity
show item after user chose from a list
Use case
22. body template 3
Single image on left side
Rich or Plain text
Background image
Scrollable text
specific entity
show item after user chose from a list
Use case
26. list template 1
ordered list
list can include thumbnail
Rich and Plain text
scroll by voice
tap item invoke event
static or dynamic lists
Use case
30. Video app
HLS
H.264
playback control
Title
Subtitle
Seek bar
GUI features
Supported format
Alexa, pause/resume
Alexa, stop/close
Voice control
32. Supported Markup for Text in Display
Templates
Line break <br/>
Bold <b>
Italics <i>
Underline <u>
Font Sizes <font size="2”> 2=28px, 3=32px, 5=48ps, 7=68px
Action <action token="VALUE">clickable text </action>
Inline Images <img src="URL" width="WIDTH" height="HEIGHT" alt="TEXT" />
Use RichText type to use markups.
39. 70% opacity black layer is recommended for
optimal contrast.
↑ Bad Good→
40. Determine the Supported Interfaces for the
Current Device
{
"version": "1.0",
"session": { 〜 },
"context": {
"System": {
"device": {
"deviceId": "amzn1.ask.device.",
"supportedInterfaces": {
"AudioPlayer": {},
"Display": {
"templateVersion": "1.0",
"markupVersion": "1.0"
},
"VideoApp": {}
}
} }
},
"request": { 〜 }
}
The supported interface is listed
in “supportedInterface”.
If the “Display” is not listed in
here, it means the device do
not support display.
41. User may not be near echo
Make the primary content of each template visible and
recognizable from up to 7 feet away.
Customers can use Echo Show from across the kitchen
or even when they cannot easily see it.
Skills need to support these possibilities.
Today I would like to talk about "Designing Skills for Echo Show".
My name is Jun Ichikawa.
I usually work as an infrastructure engineer and I am working at IoT special branch in Japan AWS Users Group.
I write articles about Amazon Alexa on my blog and Qiita.
If you are interested, please take a look.
Recently I made a Facebook group. We share information about Alexa, so if you are interested about alexa please join us.
ok lets start.
First, I would like to introduce echo show.
Amazon released these echo device which use amazon alexa.
Amazon Echo, Echo Tap, Echo dot, Echo look, and Echo Show.
Echo show is the only device which have screen.
I would like you to watch a video of what Echo Show is.
As you saw from the video, the feature of Echo Show is that it has a 7 inch touch screen and camera.
Until now, we interacted with Alexa with only voice, but now that we can watch videos and display search results.
It means, we can now offer a more good user experience.
Designing Skills for Echo Show
When using Amazon Echo to search for a neighborhood pool, echo tell us the name of the first few cases, and echo said "check details with Alexa App".
this means we need to use two device to find out something.
For third party Skill, the information that can be displayed on the Alexa App was about 1 image, title, explanation about information like this.
If Echo Show finds more than one search result, it can be displayed as a list on the screen.
You can scroll with "Alexa, scroll" and paging with "Alexa, next". Of course, you can also operate by swiping the screen or tapping it.
Since it is possible to talk with Echo by voice and visual sense, it is now possible to provide a user experience that was difficult with sound only.
so What can we do with the screen.
we can use background image
Display of long sentences( and it is scrollable)
can use list interface.
audio player controller
video player
touch interface
controlling screen by voice.
scroll up and down
scroll right and left
page up and down
next and back
To include screen displays on Echo Show in their skill, developer must use display templates.
so I want to explain about the display template.
The screen displays and interactions are created with the use of these templates.
body template
list template
this is body template one
body template one have
full width text
rich or plain text
background image
scrollable text
the use case is for showing text only or asking action to user
body template two
body template two have
single image on right side
rich or plain text
background image
Use Body Templates 2 when Alexa’s response is a specific entity (person, place, or thing) or a property of an entity that a user requested directly or chose from a list.
Make sure the user can easily view the title or image from a distance.
body template three
body template three have
single image on left side
rich or plain text
background image
scrollable area
use case is same as body template two. the difference is body template three text area can scroll.
body template six
body template six have
plain text
background image
use case is
Can be used as a welcome screen to offer guidance.
long explanation
Image does not scroll, but the text can.
list template one
list template one have
ordered list with thumbnail
rich and plain text
scroll by voice
tap item invoke event
use case is
- Static or dynamically-generated search results, menu selection, or lists
list template two
list template two have
ordered list
list item must include image
you can scroll by voice
and you can chose item by voice or tapping
Use a horizontal list when you have unique images that help users recognize or choose items from the list.
A horizontal list is also great for books, albums, videos, and products.
video
video app is used for playing video.
supported format is HLS and H.264
video app have
playback controller
title
subtitle
seek bar
video automatically start. and you can control with voice. like “Alexa, pause / resume, stop and close”
supported markups
rich text support some markups.
you can only specify font size 2, 3, 5, 7.
The default font size is 3.
For best control of your display, specify the font size.
use action if you want to add clickable text.
it can be tapped on the Echo Show screen.
action on display
echo show have a touch screen. so you can add a action.
list template items are clickable.
of course you can choose items with you voice.
action tag add clickable word to the body template.
if user chose item by voice, you need to add a sample utterance and intent schema.
but tap event invoke different request with the voice control.
the request type is “display element selected” like this.
display token is used to recognize which screen invoked the event.
and request token used to recognize what item was selected.
if you create a skill with list template or clickable text and use it for chose something.
you need to implement “voice event”, and “tap event” for good user experience.
best practices for echo show skills
if you use image like left side, the text is hard to see.
so 70% opacity black layer is recommended for optimal contrast.
right side image is good to read the text
Determine the Supported Interfaces for the Current Device.
The supported interface is listed in “supportedInterface” .
If the “Display” is not listed in here, it means the device do not support display.
do not include templates in the result.
I created a sample skill to test echo show display.
today I use skill test console.
the test console do not support tap event and voice control. so i can not show you voice control and actions.