In conversation with a browser

IN CONVERSATION WITH A BROWSER @philnash

Phil Nash @philnash http://philna.sh [email protected]

BOTS @philnash

ELIZA: How do you do. Please tell me your problem.
YOU: I've had too much beer and pizza ELIZA: You say you've had too much beer and pizza ? YOU: yes ELIZA: I understand. YOU: It's a problem ELIZA: What does that suggest to you ? @philnash

@philnash

*´¯`*.¸¸.*´¯`*ÃŇĞẸĹŜĎẸÃĎƑÃĹĹ*´¯`*.¸¸.*´¯`* @philnash

IRC > SMS > SLACK @philnash

HOW DO WE BUILD OUR OWN? @philnash

WEB APIS @philnash

WEB SPEECH @philnash

Speech Synthesis const utterance = new SpeechSynthesisUtterance(text); speechSynthesis.speak(utterance); 01. 02.
@philnash

Speech Synthesis https://glitch.com/~browser-voices @philnash

@philnash

Speech Recognition Start Speech Recognition leaving Mel JS how was
all the pizza is really useful at because it just works in again it's reefs @philnash

Speech Recognition const recognition = new webkitSpeechRecognition(); recognition.addEventListener('result', event =>
{ const result = event.results[0][0].transcript; console.log(result); }); recognition.start(); 01. 02. 03. 04. 05. 06. @philnash

@philnash

Speech Recognition Sends all the data to Google Cloud Speech
API @philnash

MEDIARECORDER API @philnash

MediaRecorder API Start recording 0:02 0:02 / 0:02 / 0:02
@philnash

MediaRecorder API const stream = await navigator.mediaDevices.getUserMedia(); const recorder =
new MediaRecorder(stream, { type: 'audio/webm' }); const chunks = []; 01. 02. 03. @philnash

MediaRecorder API recorder.addEventListener('dataavailable', event => { if (typeof event.data ===
'undefined') return; if (event.data.size === 0) return; chunks.push(event.data); }); 01. 02. 03. 04. 05. @philnash

MediaRecorder API recorder.addEventListener('stop', event => { const recording = new
Blob(chunks, { type: 'audio/webm' }); }); 01. 02. 03. 04. 05. @philnash

@philnash

MediaRecorder API https://glitch.com/~web-recorder @philnash

Then what? Send the ﬁle to a speech to text
service • Google Cloud Speech • Azure Cognitive Services • IBM Watson @philnash

WEBAUDIO API @philnash

@philnash

AUDIOWORKLET + WEBSOCKETS @philnash

DEMO @philnash

Web Speech Alternatives/Polyﬁlls https://github.com/watson-developer-cloud/speech-javascript-sdk https://github.com/anteloe/speech-polyﬁll https://github.com/compulim/web-speech-cognitive-services @philnash

THIS IS ALL GREAT... BUT @philnash

IT'S SENDING ALL THE MIC DATA TO A THIRD PARTY
SERVICE @philnash

WAKE WORDS @philnash

MACHINE LEARNING @philnash

Machine Learning TensorFlow.js ml5.js @philnash

DEMO @philnash

CONVERSATION DESIGN @philnash

SPEAK YOUR BOT CONVERSATIONS OUT LOUD WITH SOMEONE ELSE @philnash

WHAT DO WE DO WITH THIS? @philnash

TECHNICAL JOURNEY @philnash

WEB PLATFORM @philnash

EXPERIMENTATION + FREEDOM @philnash

@philnash

WEB ASSISTANT @philnash

Web Assistant https://github.com/philnash/web-assistant/ @philnash

THIS IS JUST THE START OF THE JOURNEY @philnash

@philnash

Thanks! @philnash http://philna.sh [email protected]

In conversation with a browser

In conversation with a browser

More Decks by Phil Nash

Other Decks in Programming

Featured

Transcript