Pixnet hackthon - workshop

洞洞⾒見見未來來 ∽ၻᇹ৘8PSLTIPQ Blue Chen ([email protected]) 2018/07/17 @
Pixnet

硬體介紹 Google Voice Kit 介紹

較不建議 Deep Learning Framework Community 彈性多元語⾳音助理理建議使⽤用 Model B 版本

雙麥克風擴充槽

裝 Button 找顆 LED 燈裝進去裝上 Speaker

1 2 3 4 5 6

安裝 RPI •WINDOWS 版本   https://www.botsheet.com/cht/raspberry-pi-tutorial-install-raspbian-windows/ •MAC 版本   https://www.botsheet.com/cht/raspberry-pi-tutorial-install-raspbian-windows/

詳情：https://github.com/shivasiddharth/GassistPi 安裝 Mic Firmware 先在 /home/pi 下 git clone https://github.com/shivasiddharth/GassistPi.git
若若喇喇叭有聲⾳音即是成功

語⾳音助理理介紹架構

10 Voice Assistant - Core Process voice command One Mic
process Local Speech Recognition Wake up Biometrics Speech to text yes: react no: stop Natural Language Understanding Application Wakeup step Voice command step Short commend Sound monitoring Pixnet Hackthon focus

11 Voice Assistant - Core Process voice command One Mic
process Local Speech Recognition Wake up Biometrics Speech to text yes: react no: stop Natural Language Understanding Application Wakeup step Voice command step Short commend Sound monitoring 今⽇日課程 2：接 Bing API 今⽇日課程 1：客製化喚醒詞

Ａ B C C B A 可以即時分離現場所有⼈人聲

講 5 次，迅速客製化喚醒詞今⽇日課程 1: 客製化⾃自⼰己的喚醒詞⼩小秘訣：我們錄⾳音不受限⼀一定要安靜環境，若若機器使⽤用環境處在⾼高吵雜噪⾳音下，使⽤用這樣的錄⾳音⾳音檔能在這類環境有更更好的效果。

今⽇日課程 1: 客製化⾃自⼰己的喚醒詞若若現成的開燈跟關燈不容易易喚醒，可以再次錄⾳音若若要在 Command line 中啟動：可以輸入 python
vad1.py on 啟動錄⾳音, on 可改為 off (關燈) tmp (客製化指令) 記得進入訓練模式 python train-tmp.py 1 2

今⽇日課程 1: 客製化⾃自⼰己的喚醒詞啟動應⽤用程式若若要在 Command line 中啟動，可以輸入 python tmp.py
0.65 2  0.65 為模型靈敏度，0-1 越⾼高越容易易被喚醒  2 為麥克風靈敏度， 0-2 越⾼高越靈敏

今⽇日課程 1: 客製化⾃自⼰己的喚醒詞 python tmp.py 0.65 2 Output 出結果，後續可以基於這個去串串接接下的程式

https://azure.microsoft.com/zh-tw/services/cognitive-services/directory/ 今⽇日課程 2: Speech To Text 1 2 點擊這個先申請免費帳⼾戶
串串接微軟的 Bing 服務

今⽇日課程 2: Speech To Text Copy 此串串⽂文字

今⽇日課程 2: Speech To Text 安裝⼀一下 Node.js 環境安裝 NVM
wget -qO- https://raw.githubusercontent.com/creationix/nvm/v0.33.11/install.sh | bash source ~/.bashrc ⽤用 NVM 安裝 Node.js nvm install node.js 套件地址 https://github.com/noopkat/ms-bing-speech-service 安裝 npm install ms-bing-speech-service 安裝套件

今⽇日課程 2: Speech To Text 中⽂文： zh-CN 貼上剛剛的 key

今⽇日課程 1 + 2 整合以 Node.js 為例例將 Output
後的規則去作後續識別

Demo KKBOX ⾳音樂串串流機

經驗分享幾個製作 NN 模型需注意的地⽅方

NN Layer Output Input 以聲⾳音識別為例例轉換轉換聲⾳音前處理理挑的套件要顧及開發平台 1. 不建議在
Google AIY kit ⽤用 librosa 2. 建議在 Google AIY kit 使⽤用 C base 的前處理理套件

以聲⾳音輸出為例例 NN Layer Output Input 轉換轉換⾳音檔採樣是常⾒見見問題： 1. Linux
⾳音源底層⽤用 alsa, 官⽅方 RPI 版本的效能低弱 2. 8K , 16K, 48K 採樣影響 Input 維度，前處理理後的標準值不同 3. 模型不可重覆使⽤用機率⾼高前處理理後處理理將維度轉回真實可聽的聲⾳音很花時間

26 www.relajet.com

Pixnet hackthon - workshop

Pixnet hackthon - workshop

blue chen

More Decks by blue chen

Other Decks in Education

Featured

Transcript

洞洞⾒見見未來來 ∽ၻᇹ৘8PSLTIPQ Blue Chen ([email protected]) 2018/07/17 @

硬體介紹 Google Voice Kit 介紹

較不建議 Deep Learning Framework Community 彈性多元語⾳音助理理建議使⽤用 Model B 版本

雙麥克風擴充槽

裝 Button 找顆 LED 燈裝進去裝上 Speaker

1 2 3 4 5 6

安裝 RPI •WINDOWS 版本   https://www.botsheet.com/cht/raspberry-pi-tutorial-install-raspbian-windows/ •MAC 版本   https://www.botsheet.com/cht/raspberry-pi-tutorial-install-raspbian-windows/

詳情：https://github.com/shivasiddharth/GassistPi 安裝 Mic Firmware 先在 /home/pi 下 git clone https://github.com/shivasiddharth/GassistPi.git

語⾳音助理理介紹架構

10 Voice Assistant - Core Process voice command One Mic

11 Voice Assistant - Core Process voice command One Mic

Ａ B C C B A 可以即時分離現場所有⼈人聲

講 5 次，迅速客製化喚醒詞今⽇日課程 1: 客製化⾃自⼰己的喚醒詞⼩小秘訣：我們錄⾳音不受限⼀一定要安靜環境，若若機器使⽤用環境處在⾼高吵雜噪⾳音下，使⽤用這樣的錄⾳音⾳音檔能在這類環境有更更好的效果。

今⽇日課程 1: 客製化⾃自⼰己的喚醒詞若若現成的開燈跟關燈不容易易喚醒，可以再次錄⾳音若若要在 Command line 中啟動：可以輸入 python

今⽇日課程 1: 客製化⾃自⼰己的喚醒詞啟動應⽤用程式若若要在 Command line 中啟動，可以輸入 python tmp.py

今⽇日課程 1: 客製化⾃自⼰己的喚醒詞 python tmp.py 0.65 2 Output 出結果，後續可以基於這個去串串接接下的程式

https://azure.microsoft.com/zh-tw/services/cognitive-services/directory/ 今⽇日課程 2: Speech To Text 1 2 點擊這個先申請免費帳⼾戶

今⽇日課程 2: Speech To Text Copy 此串串⽂文字

今⽇日課程 2: Speech To Text 安裝⼀一下 Node.js 環境安裝 NVM

今⽇日課程 2: Speech To Text 中⽂文： zh-CN 貼上剛剛的 key

今⽇日課程 1 + 2 整合以 Node.js 為例例將 Output

Demo KKBOX ⾳音樂串串流機

經驗分享幾個製作 NN 模型需注意的地⽅方

NN Layer Output Input 以聲⾳音識別為例例轉換轉換聲⾳音前處理理挑的套件要顧及開發平台 1. 不建議在

以聲⾳音輸出為例例 NN Layer Output Input 轉換轉換⾳音檔採樣是常⾒見見問題： 1. Linux

26 www.relajet.com