In conversation with a browser - at SydJS

IN CONVERSATION WITH A BROWSER @philnash

Phil Nash @philnash http://philna.sh [email protected]

BOTS @philnash

ELIZA: Please tell me what's been bothering you. YOU: I'm
scared of public speaking. ELIZA: Do you know anyone else who is scared of public speaking ? YOU: Lot's of people ELIZA: Do you feel strongly about discussing such things ? YOU: Only with you ELIZA: You're not really talking about me -- are you ? YOU: No, I'm the one who is scared. ELIZA: Why 'no' ? @philnash

@philnash

*´¯`*.¸¸.*´¯`*ÃŇĞẸĹŜĎẸÃĎƑÃĹĹ*´¯`*.¸¸.*´¯`* @philnash

IRC > SMS > SLACK @philnash

HOW DO WE BUILD OUR OWN? @philnash

WEB APIS @philnash

WEB SPEECH @philnash

Speech Synthesis Hello everyone const utterance = new SpeechSynthesisUtterance(text); speechSynthesis.speak(utterance);
01. 02. @philnash

Speech Synthesis https://glitch.com/~browser-voices @philnash

@philnash

Speech Recognition Start Speech Recognition start listening to me and
probably putting things into that textarea there eventually I can get it going again I can see he @philnash

Speech Recognition const recognition = new webkitSpeechRecognition(); recognition.addEventListener('result', event =>
{ const result = event.results[0][0].transcript; console.log(result); }); recognition.start(); 01. 02. 03. 04. 05. 06. @philnash

@philnash

Speech Recognition Sends all the data to Google Cloud Speech
API @philnash

MEDIARECORDER API @philnash

MediaRecorder API Start recording 0:00 0:00 @philnash

MediaRecorder API const stream = await navigator.mediaDevices.getUserMedia(); const recorder =
new MediaRecorder(stream, { type: 'audio/webm' }); const chunks = []; 01. 02. 03. @philnash

MediaRecorder API recorder.addEventListener('dataavailable', event => { if (typeof event.data ===
'undefined') return; if (event.data.size === 0) return; chunks.push(event.data); }); 01. 02. 03. 04. 05. @philnash

MediaRecorder API recorder.addEventListener('stop', event => { const recording = new
Blob(chunks, { type: 'audio/webm' }); }); 01. 02. 03. 04. 05. @philnash

@philnash

MediaRecorder API https://glitch.com/~web-recorder @philnash

Then what? Send the ﬁle to a speech to text
service • Google Cloud Speech • Azure Cognitive Services • IBM Watson @philnash

WEBAUDIO API @philnash

@philnash

AUDIOWORKLET + WEBSOCKETS @philnash

DEMO @philnash

Web Speech Alternatives/Polyﬁlls https://github.com/watson-developer-cloud/speech-javascript-sdk https://github.com/anteloe/speech-polyﬁll https://github.com/compulim/web-speech-cognitive-services @philnash

THIS IS ALL GREAT... BUT @philnash

IT'S SENDING ALL THE MIC DATA TO A THIRD PARTY
SERVICE @philnash

WAKE WORDS @philnash

MACHINE LEARNING @philnash

Machine Learning TensorFlow.js ml5.js @philnash

DEMO @philnash

CONVERSATION DESIGN @philnash

SPEAK YOUR BOT CONVERSATIONS OUT LOUD WITH SOMEONE ELSE @philnash

WHAT DO WE DO WITH THIS? @philnash

TECHNICAL JOURNEY @philnash

WEB PLATFORM @philnash

EXPERIMENTATION + FREEDOM @philnash

@philnash

WEB ASSISTANT @philnash

Web Assistant https://github.com/philnash/web-assistant/ @philnash

THIS IS JUST THE START OF THE JOURNEY @philnash

@philnash

Thanks! @philnash http://philna.sh [email protected]

In conversation with a browser - at SydJS

In conversation with a browser - at SydJS

More Decks by Phil Nash

Other Decks in Programming

Featured

Transcript