Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
機械学習のための音声信号処理(基礎編)/ Speech Signal Processing
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
moonlight-aska
October 21, 2018
1.3k
0
Share
機械学習のための音声信号処理(基礎編)/ Speech Signal Processing
2018年10月21日開催の「大江橋Pythonの会#3」の資料です.
moonlight-aska
October 21, 2018
More Decks by moonlight-aska
See All by moonlight-aska
Create Your Own AI with Dify×Gemma3
aska
0
62
Generative AI Prototyping
aska
0
25
【入門】プロンプトの書き方のコツ / Tips for writing prompts
aska
0
220
CHATGPT。はじめの一歩 / ChatGPT. Get Started
aska
0
140
「Kingyo AI Navi」アプリ / Kingyo AI Navi App
aska
0
270
Kingo AI Navi LINEをもっと使い倒せ!!
aska
0
160
Depth画像で物体検知やってみたー。/ Objects Detection with Depth Images
aska
0
830
Kingyo AI Naviアプリ開発 / Kingyo AI Navi App
aska
0
450
AutoML Vision Edgeで金魚分類モデルを学習してみた / Kingyo Classification Model with AutoML Vision Edge
aska
0
590
Featured
See All Featured
Lightning Talk: Beautiful Slides for Beginners
inesmontani
PRO
2
570
Winning Ecommerce Organic Search in an AI Era - #searchnstuff2025
aleyda
1
2k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
6k
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1.3k
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
71
40k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.8k
The #1 spot is gone: here's how to win anyway
tamaranovitovic
2
1.1k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.9k
From Legacy to Launchpad: Building Startup-Ready Communities
dugsong
0
220
What the history of the web can teach us about the future of AI
inesmontani
PRO
1
600
Future Trends and Review - Lecture 12 - Web Technologies (1019888BNR)
signer
PRO
0
3.6k
Navigating the Design Leadership Dip - Product Design Week Design Leaders+ Conference 2024
apolaine
1
340
Transcript
None
NARA
안녕하세요
None
f P t
None
None
None
None
None
None
Convolutional Recurrent Neural Network
[音情報処理論 音声処理における信号処理1より引用]
None
None
×
×
[音声言語処理特論 第2回音声認識の基礎、DPマッチングの基礎より引用]
None
None
None
None
None
None
https://www.kaggle.com/c/tensorflow-speech-recognition-challenge
None
MFCC12 ΔMFCC12 ΔΔMFCC12
None
None
None
None
--- No. 6795 edc53350_nohash_0.wav (house ) --- * 1位 :
house (0.999820) 2位 : cat (0.000060) 3位 : off (0.000032) 4位 : yes (0.000024) 5位 : down (0.000013) --- No. 6796 e95c70e2_nohash_0.wav (house ) --- * 1位 : house (0.999953) 2位 : off (0.000012) 3位 : cat (0.000005) 4位 : eight (0.000004) 5位 : happy (0.000004) --- No. 6797 258f4559_nohash_0.wav (house ) --- * 1位 : house (0.999980) 2位 : off (0.000007) 3位 : happy (0.000004) 4位 : cat (0.000004) 5位 : eight (0.000002) --- No. 6798 1657c9fa_nohash_0.wav (house ) --- * 1位 : house (0.999972) 2位 : off (0.000011) 3位 : yes (0.000003) 4位 : happy (0.000003) 5位 : cat (0.000003) ---------- Total Accuracy ---------- 1位 : 93.57 % ( 6361 / 6798 ) 2位 : 96.78 % ( 6579 / 6798 ) 3位 : 97.78 % ( 6647 / 6798 ) 4位 : 98.35 % ( 6686 / 6798 ) 5位 : 98.57 % ( 6701 / 6798 )
None
NARA
None
None
None
https://ai.googleblog.com/2017/08/launching-speech-commands-dataset.html