Slide 1

Slide 1 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Sébastien Stormacq Principal Developer Advocate, AWS @sebsto An Introduction to Conversational Interfaces

Slide 2

Slide 2 text

© 2020, Amazon Web Services, Inc. or its Affiliates.

Slide 3

Slide 3 text

© 2020, Amazon Web Services, Inc. or its Affiliates. 70s 80s 90s 00s Present Graphical User Interface (GUI) Character Mode Voice User Interface (VUI) Mobile Web

Slide 4

Slide 4 text

© 2020, Amazon Web Services, Inc. or its Affiliates. HUMAN ACCURACY 50% 55% 60% 62% 70% 95% ASR accuracy has dramatically increased in the last 4-5 years. This inflection point has created sustained momentum in consumer adoption of voice technology Source: MindMeld

Slide 5

Slide 5 text

© 2020, Amazon Web Services, Inc. or its Affiliates. VOICE REPRESENTS THE NEXT MAJOR DISRUPTION IN COMPUTING

Slide 6

Slide 6 text

© 2020, Amazon Web Services, Inc. or its Affiliates.

Slide 7

Slide 7 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Voice Everywhere AT HOME ON THE GO AT WORK

Slide 8

Slide 8 text

© 2020, Amazon Web Services, Inc. or its Affiliates. “ There’s no future that doesn’t have ambient computing or voice activation ” Mark Cuban, Entrepreneur

Slide 9

Slide 9 text

© 2020, Amazon Web Services, Inc. or its Affiliates. How it works Request Audio Response Your Service Text to Speech Machine Learning Natural Language Understanding Speech Recognition Cards

Slide 10

Slide 10 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Wake word detection Beam forming Cards Your Service Text to Speech Machine Learning Natural Language Understanding Speech Recognition Request Response Signal Processing

Slide 11

Slide 11 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Speech to text Your Service Text to Speech Machine Learning Natural Language Understanding Speech Recognition Response Cards Utterances Intents

Slide 12

Slide 12 text

© 2020, Amazon Web Services, Inc. or its Affiliates. ASR – Automatic Speech Recognition fo ̇ r tē tīmz Forty Times? 40x For Tea Times? For Tee Times?

Slide 13

Slide 13 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Utterances Into Intents Your Service Text to Speech Machine Learning Natural Language Understanding Speech Recognition Response Cards Utterances Intents

Slide 14

Slide 14 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Alexa, wake word S P E A K I N G W I T H A L E X A

Slide 15

Slide 15 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Alexa, ask restaurant finder for breakfast wake word Invocation name launch S P E A K I N G W I T H A L E X A utterance for a good breakfast place to find places to eat in the morning a spot that serves the most important meal of the day

Slide 16

Slide 16 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Utterances Intent Utterances and Intents RecomendationIntent to find places to eat in the morning a good breakfast place where to eat breakfast

Slide 17

Slide 17 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Alexa, open restaurant finder Intents and Slots Recommend a restaurant within four miles slot utterance RecommendationIntent {distance: “4”} slot value intent

Slide 18

Slide 18 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Tell me about Italian food within four miles slot RecommendationIntent {distance: “4”} slot value intent slot {cuisine: “Italian food”} Intents and Slots

Slide 19

Slide 19 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Requests and Responses Your Service Text to Speech Machine Learning Natural Language Understanding Speech Recognition Response Cards Audio Intents

Slide 20

Slide 20 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Speech Output Your Service Text to Speech SSML, streaming audio Machine Learning Natural Language Understanding Speech Recognition Response Cards Audio Intents

Slide 21

Slide 21 text

© 2020, Amazon Web Services, Inc. or its Affiliates. SSML • Speech Synthesis Markup Language • TTS (Text-to-Speech)

Slide 22

Slide 22 text

© 2020, Amazon Web Services, Inc. or its Affiliates. SSML • Pauses well, okay well okay

Slide 23

Slide 23 text

© 2020, Amazon Web Services, Inc. or its Affiliates. SSML • Pronunciation Hints Hello

Slide 24

Slide 24 text

© 2020, Amazon Web Services, Inc. or its Affiliates. SSML • Speechcons Hurray!

Slide 25

Slide 25 text

© 2020, Amazon Web Services, Inc. or its Affiliates. The Brain of Alexa in your Apps Amazon Lex

Slide 26

Slide 26 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Designed for builders; efficient tools to build conversations; scales automatically Enterprise ready: Connect to enterprise systems via SaaS connectors Build once and deploy to multiple platforms Text and speech language understanding: Powered by the same technology as Alexa Amazon Lex: Conversational engine

Slide 27

Slide 27 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Should I develop for Amazon Alexa or Amazon Lex? * Images and video on Echo Show, Echo Spot, and other devices with a screen Amazon Alexa Amazon Lex Apps Skills Bots Medium Voice and Displays* DIY (rendering) Out of the Box Built-in + Skills Nothing Distribution Skill Store DIY (your app or device) Devices Alexa Enabled Devices (Amazon & 3P) Yours (web, mobile, hardware) Reach Alexa Customers Your Customers * Images and video on Echo Show, Echo Spot, and other devices with a screen

Slide 28

Slide 28 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Utterances Utterances are spoken or typed phrases that invoke your intent BookHotel Intents Intents perform an action in response to natural language user input Slots Slots are input data required to fulfill the intent Fulfillment Fulfillment mechanism for your intent . Amazon Lex

Slide 29

Slide 29 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Designing for Conversation

Slide 30

Slide 30 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Check in 4/19/2017 City New York City I would like to book a hotel Sure, which city will you be traveling to? New York City What date do you want to check in? April 19th Slot elicitation

Slide 31

Slide 31 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Simple Declarative Model Slots Prompts Build multi-turn conversations Easy setup in console Sure, which city will you be traveling to? New York City What date do you want to check in? Tomorrow … And how many nights is this for? Dialog management City Check-In Date Check-Out Date Which city will you be traveling to? What date do you want to check in? How many nights is this for? I would like to book a hotel

Slide 32

Slide 32 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Would you prefer to stay downtown this time as well? Yes What date do you want to check in? April 19th Personalize conversation based on user preferences Validate user input and re-prompt as necessary Lambda function Customize conversations Sorry, no availability. Would a different location work? I would like to book a hotel

Slide 33

Slide 33 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Sorry, can you please repeat that? I am having trouble understanding. Can you please say that again? Clarifies by requesting that user repeat Hang-up phrase to end the conversation Error handling Sorry, I am not able to assist you at this time. I want to …. {garbled} ….. Uses a different prompt every time

Slide 34

Slide 34 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Compact • Formatted for messaging platforms • Multiple cards supported • Preview capability • Test in console Rich message formatting Sure. What type of car are you looking for? I would like to rent a car

Slide 35

Slide 35 text

© 2020, Amazon Web Services, Inc. or its Affiliates. AWS Lambda integration Return to client User input is parsed to derive intents and slot values; output is returned to client for further processing Intents and slots are passed to the AWS Lambda function for business logic implementation Fulfillment

Slide 36

Slide 36 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Start with people, not computers Source: BBC, Research & Development, Prototyping for Voice: Methodology – bit.ly/bbc-vui (last accessed August 2019)

Slide 37

Slide 37 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Pay attention to details Plan for escapes Test Be brief #1 #10 #9 #8 #7 #6 Be coherent #5 Offer multiple choices #4 Re-engage conversation #3 Stay natural #2 Be gender neutral Teach customers Tips for great conversations

Slide 38

Slide 38 text

© 2020, Amazon Web Services, Inc. or its Affiliates. Thank you! Sébastien Stormacq Principal Developer Advocate, AWS @sebsto