Voice assistants have taken off, but can we build our own with web technologies? I've been building bots for other platforms, but I wanted to investigate how well one could work in the browser. Can we talk to a web application and get results?

Let's dive into the Web Speech API, speech synthesis, and conversation design. We'll find out whether browsers can be virtual assistants or virtually useless.



Browser voices:

An introduction to the MediaRecorder API:
Web recorder:

Speech to text with Watson in the browser:

Tensorflow speech model:


