Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Building voice enabled experiences with Alexa

Ankita Saloni
September 12, 2021

Building voice enabled experiences with Alexa

Ankita Saloni

September 12, 2021
Tweet

More Decks by Ankita Saloni

Other Decks in Technology

Transcript

  1. 2

  2. Human conversational partners can: • construct meaning based on shared

    context • evolve the data sets upon which they draw • converge on increased understanding of each other's intentions • remember novel information beyond a given conversational session
  3. Amazon Alexa, also known simply as Alexa, is a virtual

    assistant AI technology which can play audio, control your smart home, answer questions, engage your favorite services and much more.
  4. Conversational AI is the set of technologies behind automated messaging

    and speech-enabled applications that offer human-like interactions bet ween computers and humans.
  5. A Voice-User Inter face( VUI) makes spoken human interaction with

    computers possible, using speech recognition to understand spoken commands and answer questions, and t ypically tex t to speech to give a reply.
  6. The Alexa Skills Kit (ASK) is a sof t ware

    development framework that provides self-ser vice APIs and tools which you can use to build Alexa skills. Sof t ware Development Kits (SDKs) are sof t ware development tools and libraries that give you programmatic access to Alexa features for building custom skills.
  7. 70s 80s 90s 00s Present Graphical User Interface (GUI) Character

    Mode Voice User Interface (VUI) Mobile Web
  8. HUMAN ACCURACY 50% 55% 60% 62% 70% 95% ASR accuracy

    has dramatically increased in the last 4-5 years. This inflection point has created sustained momentum in consumer adoption of voice technology Source: MindMeld
  9. Alexa is Growing Fast P R O D U C

    T S W I T H A L E X A B U I L T - I N S U P P O R T E D C O U N T R I E S A L E X A S K I L L S A L E X A D E V E L O P E R S P R O D U C T S T H A T W O R K W I T H A L E X A 1,00,000+ HUNDREDS OF THOUSANDS 150+ 28,000+ 15 Numbers as of Dec 2019
  10. Alexa Skills Kit Request Audio Response Your Service Text to

    Speech Machine Learning Natural Language Understanding Speech Recognition Cards
  11. Wake word detection Beam forming Cards Your Service Text to

    Speech Machine Learning Natural Language Understanding Speech Recognition Request Response Alexa Skills Kit: Signal Processing
  12. Alexa Skills Kit: Speech to text Your Service Text to

    Speech Machine Learning Natural Language Understanding Speech Recognition Response Cards Utterances Intents
  13. fo ̇ r tē tīmz Forty Times? 40x ASR –

    Automatic Speech Recognition
  14. fo ̇ r tē tīmz Forty Times? 40x For Tea

    Times? ASR – Automatic Speech Recognition
  15. fo ̇ r tē tīmz Forty Times? For Tea Times?

    For Tee Times? 40x ASR – Automatic Speech Recognition
  16. fo ̇ r tē tīmz Forty Times? For Tea Times?

    Four Tee Times? 40x ASR – Automatic Speech Recognition
  17. Alexa Skills Kit: Utterances Into Intents Your Service Text to

    Speech Machine Learning Natural Language Understanding Speech Recognition Response Cards Utterances Intents
  18. Alexa, wake word S P E A K I N

    G W I T H A L E X A
  19. Alexa, ask restaurant finder for breakfast wake word Invocation name

    launch S P E A K I N G W I T H A L E X A utterance for a good breakfast place to find places to eat in the morning a spot that serves the most important meal of the day
  20. Utterances Intent Utterances and Intents RecommendationIntent to find places to

    eat in the morning a good breakfast place where to eat breakfast
  21. Alexa, open restaurant finder invocation name Intents and Slots Recommend

    a restaurant within four miles slot utterance RecommendationIntent {distance: “4”} slot value intent launch wake word
  22. Tell me about Italian food within four miles slot RecommendationIntent

    {distance: “4”} slot value intent slot {cuisine: “Italian food”} Intents and Slots
  23. Alexa Skills Kit: Requests and Responses Your Service Text to

    Speech Machine Learning Natural Language Understanding Speech Recognition Response Cards Audio Intents
  24. Alexa Skills Kit: Output Your Service Text to Speech SSML,

    streaming audio Machine Learning Natural Language Understanding Speech Recognition Response Cards Audio Intents
  25. Under the hood On-Device Processing Cloud Processing Wake Word Detection

    Automatic Speech Recognition Natural Language Understanding Skill Text to Speech Signal Processing User: “Alexa, is it hot outside?” Audio Echo: “The weather in Delhi is:” Beamformed Signal Audio Stream Speech to Text Intent: GetWeather Location: Delhi, IN
  26. While designing a skill, ensure that it should be -

    • User friendly • Easy to remember
  27. While designing a skill, ensure that it should be -

    • User friendly • Easy to remember • Personable
  28. Cake Walk is a skill that celebrates your birthday! Tell

    it your birthday to have it count down the days. Interact with the skill on your special day to hear a happy birthday message.
  29. How it all Works Your Service Text to Speech Machine

    Learning Natural Language Understanding Speech Recognition Response Visual Audio Request
  30. How to Host your Backend Resource • Node.js SDK •

    Best with Node.js v8.10 or higher • Python SDK • Java SDK • AWS Lambda function • AWS CloudWatch Logs • Amazon S3 bucket • Amazon IAM role • Amazon DynamoDB table
  31. const handler = { canHandle(handlerInput) { // Returns true if

    the handler can service the request }, handle(handlerInput) { // Code to handle the request } } Understanding the SDK
  32. How to Activate Auto-delegation 1. Define Slots 2. Mark them

    Required 3. Define Prompts 4. Choose a Delegation Strategy
  33. Publishing Skills 1. Describe your skill 2. Provide up to

    three example phrases 3. Provide large and small icon • Must prove you own any IP or copyrighted material
  34. Invest Time in Your Metadata Skill Description for instructions, tips,

    FAQs, context Skill Icon Skill Keywords for discoverability Skill Example Utterances for helping users start out
  35. Leverage the Beta Functionality Give up to 500 users early

    access to your Skill Even just a few users will catch majority of issues – use it!
  36. Alexa Skill Certification Guidelines Amazon will test your skill for:

    • Functionality • Content Guidelines / Policy • Security (HTTPS) • Voice Interface
  37. Common Certification Failures • Incorrectly worded Example phrases • Intent

    response errors • Infringement on Intellectual Property • Invocation name guidelines not met • One word invocation names like “kiddo” not allowed • Doesn’t follow tokenization guidelines e.g., use k. b. c. quiz instead of kbc quiz
  38. alexa.design/AWScredits AWS Promotional Credits Alexa Developer Rewards Monthly Incentive Programs

    alexa.design/india_rewards alexa.design/offers    Developer Incentives
  39. alexa.design/AWScredits • $100 in credits when you publish your first

    skill • Additional $100/month if your skill incurs AWS usage charges 1 AWS Promotional Credits
  40. alexa.design/india_rewards • Developers of skills with the highest customer engagement

    paid monthly • Only Indian developers publishing for the India Alexa Skills Store qualify • 8 eligible categories, engagement calculated based on factors like minutes of usage, new customers, recurring customers, sessions, and more 2 Developer Rewards Program
  41. alexa.design/offers • Build new engaging skills and earn swag every

    month • Bonus Incentive – Earn Sony PlayStation 4 if your new skill published in March gets over n unique customers 3 Monthly Developer Incentives
  42. Customer Profile API SnapTravel KptnCook Recipes Sends SMS to confirm

    travel arrangements Sends Recipe recommendations to email alexa.design/profile-api