Building a voice application for Amazon Alexa requires the Voice First approach. But with the growing device family with displays like the Echo Spot, the Echo Show, or the Fire TV, you are able to support your voice experience with photos, illustrations, or videos. This session concentrates on how to build a Multi-Modal application with Amazon Alexa. We will have a closer look on the best-practices as well as some tools and techniques to help you to create richer voice applications.
10. 10 / 78
Multimodal
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
11. 11 / 78
Textual
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
12. 12 / 78
Aural
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
13. 13 / 78
Linguistic
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
14. 14 / 78
Spatial
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
15. 15 / 78
Visual
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
23. 23 / 78
Echo Buttons
Another input media.
Are they multimodal?
What do you think?
24. 24 / 78
AWS Lambda /
HTTPS Endpoint
Server
Alexa Voice Service
Multimodal
25. 25 / 78
Echo Dot Echo Echo Show Echo Spot
Headless vs. Multimodal
26. 26 / 78
Display devices
Only 5.9 % of Alexa users in
the US own an Echo device
with a display
Voicebot.ai, June 24, 2018, see https://goo.gl/7WSkjD
27. 27 / 78
More numbers
56.2 % own Echo Spot
25.0 % own Echo Show
18.7 % own both devices
Voicebot.ai, June 24, 2018, see https://goo.gl/7WSkjD
28. 28 / 78
Mind the trap!
Many Alexa Skills rather
focus on Echo Show than on
Echo Spot.
More than twice as much
Echo Spot than Echo Show
devices sold.
57. 57 / 78
APL document
JSON file containing list of
packages, resources, layouts,
and styles.
Works like a container and is
send to the device.
58. 58 / 78
APL package
Packages contain APL
documents and images.
Can easily be reused and are
cached on the device.
59. 59 / 78
APL layout
Hierarchy set of components
for rendering one the display.
Can contain text, images,
scrolling regions and even
other layouts.
Can be used to build libraries.
60. 60 / 78
APL resources
Defined constants to be used
for drawing text or images on
the screen.
For example font sizes,
colours or spacing.
61. 61 / 78
APL styles
Collection of grouped
resources to build a style.
Defines size, background
colour, text colour, borders,
etc.
62. 62 / 78
APL components
Components are primitive
types to be added to a layout.
Examples are containers,
text, images, sequences,
scroll views, or touch
wrappers
76. 76 / 78
Start with APL
Public beta phase
You could start today
Consider the numbers of sold
display devices!
77. 77 / 78
Need more
motivation?
Alexa Skills Challenge
$150K in total prizes
Bonus prize for Germany
Enter til 22th of January 2019
https://goo.gl/EETRu5
78. 78 / 78
Any questions?
ralf@travello.audio
https://www.travello.audio