Web Speech API

Web Speech APIs

$WHOAMI I am Pascal Helfenstein nicam

Why speech?

What do we need?

Speech Recognition Speech Synthesis “It can talk” “It can hear”
Some Intelligence “It can think”

The browser can do it!

How does it speak? var msg = new SpeechSynthesisUtterance('Hello World');
// Config (optional) msg.voice = 'Google UK English Male'; msg.volume = 1; // 0 <-> 1 msg.rate = 1.5; // 0 <-> 1 msg.pitch = 1; // 0 <-> 2 speechSynthesis.speak(msg);

How does it listen? var recognition = new webkitSpeechRecognition(); //
Config (optional) recognition.lang = "en-US"; // "de-DE", "en-IN" recognition.continuous = true; recognition.interimResults = true; recognition.addEventListener('result', function(event) { console.log(event.results[0][0].transcript); // "whats the time" console.log(event.results[0][0].confidence); // 0.93414807319641 }); recognition.addEventListener('end', function(event) { // Start your action }); recognition.start();

How does it think? \_(ツ)_/¯

Ask \_(ツ)_/¯

WebSockets API Calls Demo architecture Speech Recognition Speech Synthesis

Browser Support

What to look out for • interactivity requires SSL… •
synthesising long texts breaks it • it’s not good in noisy environments

Questions?

Thanks!

Web Speech API

Web Speech API

Pascal Helfenstein

More Decks by Pascal Helfenstein

Other Decks in Programming

Featured

Transcript

Web Speech APIs

$WHOAMI I am Pascal Helfenstein nicam

Why speech?

What do we need?

Speech Recognition Speech Synthesis “It can talk” “It can hear”

The browser can do it!

How does it speak? var msg = new SpeechSynthesisUtterance('Hello World');

How does it listen? var recognition = new webkitSpeechRecognition(); //

How does it think? \_(ツ)_/¯

Ask \_(ツ)_/¯

DEMO

WebSockets API Calls Demo architecture Speech Recognition Speech Synthesis

Browser Support

What to look out for • interactivity requires SSL… •

Questions?

Thanks!