[VoiceCon] The Voice Technology Landscape

The Voice Technology Landscape A Look Ahead to 2020 &
Beyond

I'm tech lead for Voice & Emerging Platforms at NPR
(National Public Radio) in the USA You can ﬁnd & follow me at @xiehan I AM NARA KASBERGEN Guten Tag!

WHAT IS N.P.R.? A nationwide network of public radio stations

WHY DOES N.P.R. CARE ABOUT VOICE? Then: Now:

The Voice Platforms Team at NPR

MY BIASES I'm a software engineer Consumer app developer Audio
(long-form) "Smart speakers"

WHERE ARE WE GOING?

BOLD PREDICTION By 2030, we'll have built J.A.R.V.I.S.

ELEMENTS NEEDED FOR VOICE UTOPIA (OR, J.A.R.V.I.S.) 1. Better NLU
2. Better AI 3. Ubiquity 4. Connectivity 5. Trust

Better NLU 1

PROBLEM STATEMENT We need machines to understand humans the way
humans naturally speak Wake words & invocation names are clunky The more complex the query, the harder for users to remember the proper order Users want to speak to voice assistants and devices in their native language

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 In 2020,
we will see more platform workarounds to avoid requiring invocation names and wake words

EXAMPLE #1: GOOGLE ALARMS You used to have to say
"Hey Google, stop" to stop an alarm that is going off Now, you can just say "stop" (because Google Assistant anticipates that you are going to say something when the alarm starts going off)

EXAMPLE #2: BIXBY N.L. CATEGORIES Example: Rideshare (Uber, Lyft, Taxify,
etc.) The ﬁrst time the user says "Hi Bixby, get me a ride" they choose their preferred app After that, "Hi Bixby, get me a ride" always defaults to that provider (No more need for invocation names, i.e. "Bixby, ask Uber to get me a ride")

Bixby NL Categories

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Other platforms
will copy Bixby's NL categories ASAP

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Bixby could
do well in Europe if and only if Samsung moves fast enough to add more languages

Better AI 2

PROBLEM STATEMENT So far, much of the AI focus in
this space has concentrated on the platform side The platform side is a black box, and at this point that seems unlikely to change Consumer applications need AI in order to provide richer, more helpful, context- dependent responses to user queries

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Free business
idea: "AI as a service" for voice consumer apps

Screenshot of Invocable

A WORD ABOUT VOICE DEV PLATFORMS RIGHT NOW The current
ecosystem is not great There are too many different ways to build skills/actions, not easy to switch back & forth Platforms have focused on a "lowest common denominator" approach, optimizing for non-technical people This approach is holding back the ﬁeld

other companies will copy Samsung's best ideas from the Bixby developer platform

Google might actually ﬁgure out its voice development strategy

Google might actually ﬁgure out its voice development strategy We just might not like it

Google Assistant docs

Ubiquity 3

PROBLEM STATEMENT We need to be able to talk to
our voice assistants anytime, anywhere Too much emphasis has been placed on smart phones The smart home will be the key to moving us closer to this utopia Context-awareness is the key

Amazon will continue to lead the pack by embedding Alexa in anything it can

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Samsung has
the opportunity to quickly gain an edge because of the large numbers of hardware already in people's homes

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Smart fridge
sales are not going to soar in 2020

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Google's Nest/Home
rebranding confusion will hurt it in 2020

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 If Apple
doesn't move faster in this area, it's going to fall even further behind

Connectivity 4

PROBLEM STATEMENT Currently, voice technology relies heavily on an Internet
connection because recordings are transcribed and processed in the cloud, and consumer apps also run in the cloud There are two solutions to this: 1. Improve connectivity (faster connections, fewer dead zones) 2. Provide more ofﬂine-ﬁrst possibilities

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 The rollout
of 5G will improve connectivity and therefore the usability of voice assistants

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Dead zones
will continue to be a problem

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Platforms will
begin to move more of their NLP to devices in order to allow for more ofﬂine- ﬁrst possibilities

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 The wildcard
factor: What about ofﬂine-ﬁrst consumer apps?

Trust 5

PROBLEM STATEMENT No one is going to want JARVIS in
their home and in their life if they don't trust it The fact that voice tech has been dominated by big tech companies that have a poor reputation for respecting consumer privacy has hurt this space Tech companies need to win back our trust

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 There will
be more privacy scandals in 2020

Screenshot of 2019 Amazon privacy scandal

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 We could
see our ﬁrst GDPR lawsuit related to voice technology in 2020

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Facebook may
launch a voice assistant in 2020

OTHER PREDICTIONS

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 There will
be more small feature releases to improve monetization

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Alexa Presentation
Language (APL) will be deprecated

APL, we hardly knew you...

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Multimodal devices
will continue to be important and not important at the same time

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 2020 may
determine whether Bixby has a future or not

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Microsoft will
give up on Cortana (but may not kill it completely just yet)

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Apple could
ﬁnally do something and it could change everything

PREDICTION BY NARA KASBERGEN (@XIEHAN) DEC 11, 2019 Apple could
ﬁnally do something and it could change everything Apple also may not do anything

voice tech could shift back toward native mobile app integrations

Danke Schön! ANY QUESTIONS? @xiehan on Twitter [email protected]

[VoiceCon] The Voice Technology Landscape

[VoiceCon] The Voice Technology Landscape

More Decks by Nara Kasbergen

Other Decks in Technology

Featured

Transcript