Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fundamentals of Music Processing (Chapter 5)
Search
Koga Kobayashi
December 12, 2019
Research
110
0
Share
Fundamentals of Music Processing (Chapter 5)
Koga Kobayashi
December 12, 2019
More Decks by Koga Kobayashi
See All by Koga Kobayashi
第13回 Data-Centric AI勉強会, LLMのファインチューニングデータ
kajyuuen
4
1.8k
基礎数学の公式
kajyuuen
1
170
初等確率論の基礎
kajyuuen
1
190
Deep Markov Model を数式で追う (+ Pyroでの追試)
kajyuuen
0
950
完全なアノテーションが得られない状況下での固有表現抽出
kajyuuen
3
3.7k
SecHack365 北海道会 LT
kajyuuen
0
540
専門用語抽出手法の研究と 抽出アプリケーションの開発
kajyuuen
1
1.3k
Other Decks in Research
See All in Research
AIを叩き台として、 「検証」から「共創」へと進化するリサーチ
mela_dayo
0
250
ウェブ・ソーシャルメディア論文読み会 第36回: The Stepwise Deception: Simulating the Evolution from True News to Fake News with LLM Agents (EMNLP, 2025)
hkefka385
0
220
データサイエンティストの業務変化
datascientistsociety
PRO
0
390
LLM Compute Infrastructure Overview
karakurist
2
1.3k
多様なデータを許容し学習し続ける模倣学習 / Advanced Imitation Learning for VLA
prinlab
0
180
[チュートリアル] 電波マップ構築入門 :研究動向と課題設定の勘所
k_sato
0
410
LINEヤフー データサイエンス Meetup「三井物産コモディティ予測チャレンジ」の舞台裏-AlpacaTechパート
gamella
1
490
非試合日の野球場を楽しむためのARホームランボールキャッチ体験システムの開発 / EC79-miyazaki
yumulab
0
170
20年前に50代だった人たちの今
hysmrk
0
190
ブレグマン距離最小化に基づくリース表現量推定:バイアス除去学習の統一理論
masakat0
0
250
業界横断 副業コンプライアンス調査 三者(副業者・本業先・発注者)におけるトラブル認知ギャップの構造分析
fkske
0
1.3k
typst の使い方:言語学を研究する学生のために
gitomochang
0
400
Featured
See All Featured
The Director’s Chair: Orchestrating AI for Truly Effective Learning
tmiket
1
160
A designer walks into a library…
pauljervisheath
211
24k
Future Trends and Review - Lecture 12 - Web Technologies (1019888BNR)
signer
PRO
0
3.5k
Making Projects Easy
brettharned
120
6.6k
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
510
Data-driven link building: lessons from a $708K investment (BrightonSEO talk)
szymonslowik
1
1k
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.7k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
3k
SEO for Brand Visibility & Recognition
aleyda
0
4.5k
Automating Front-end Workflow
addyosmani
1370
200k
Avoiding the “Bad Training, Faster” Trap in the Age of AI
tmiket
0
140
B2B Lead Gen: Tactics, Traps & Triumph
marketingsoph
0
110
Transcript
Fundamentals of Music Processing Chapter 5: Chord Recognition খྛ
ᕣՏ εϥΠυʹؚ·ΕΔਤFundamentals of Music ProcessingΑΓҾ༻
Chapter 5: Chord Recognition Chord(Ի) • 3ͭҎ্ͷҟͳΔԻූ͔Βߏ͞ΕΔԻͷ͜ͱ Harmony() • ෳͷԻ͔ΒͳΔܥྻɺԻਐߦ
FM7 G7 Em7 Am Harmony
Chapter 5: Chord Recognition Chord Recognition(Իೝࣝ) • Ի͔ΒԻਐߦΛೝࣝ͢Δٕज़ ԻָϑΝΠϧ͔ΒίʔυේΛࣗಈͰ࡞ग़དྷΔ Իೝ͕ࣝ͏·͍͘͘ͱ…
Chapter 5.3: HMM-Based Chord Recognition Chapter 5.1~5.2 • ಛྔ͔ΒԻΛ͋Δఔਪఆग़དྷΔ ͔͠͠ɺ͜ΕԻҰͭҰ͔ͭ͠ݟ͍ͯͳ͍
Α͘ग़ΔԻܨ͕Γͷڧ͍Իʹண͍ͨ͠ ྫ: I–IV–V–Iਐߦ • FGසग़͠ɺ͍͖ͳΓFmʹߦ͘͜ͱ΄΅ແ͍ HMM(ӅΕϚϧίϑϞσϧ)Λར༻ͯ͠ԻਪఆΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition Ի: , ঢ়ଶ: (for )
ͱͨ͠ͱ͖ɺϚϧίϑੑΛԾఆ͢Δͱ Ͱ͋Δ֬ A := {α1 , α2 , ⋯, αI } s i ∈ [1 : I] sn+1 = αj P[sn+1 = αj |sn = αi , sn−1 = αk , ⋯] = P[sn+1 = αj |P[sn+1 = αj |sn ] ͜͜Ͱ ΛҎԼͷΑ͏ʹఆٛ͢Δɻ(for ) aij i, j ∈ [1 : I] ͜Εঢ়ଶ͕ ͔Β ʹભҠ͢Δ֬ͱߟ͑Δ͜ͱ͕ग़དྷΔ αi αj aij := P[sn+1 = αj |sn = αi ] ∈ [0,1] ·ͣɺϚϧίϑ࿈Λར༻ͨ͠Ի༧ଌʹ͍ͭͯઆ໌͢Δ
Chapter 5.3: HMM-Based Chord Recognition ۩ମྫ; , ͷͱ͖ I =
3 A := {α1 = C, α2 = G, α3 = F} ·ͨ࠷ॳͷঢ়ଶ͕ Ͱ͋Δ֬ΛҎԼͷΑ͏ʹఆٛ͢Δ αi ci := P[s1 = αi ] ∈ [0,1]
Chapter 5.3: HMM-Based Chord Recognition , , ͷͱ͖ ঢ়ଶܥྻ: ʹ͍ͭͯߟ͑Δ
I = 3 A := {α1 = C, α2 = G, α3 = F} C = (c1 , c2 , c3 )T = (0.6,0.2,0.3)T S = (C, C, C, G, G, F, F, C, C) ࠷ॳͷঢ়ଶ͕ Ͱ͋Δ֬ΛҎԼͷΑ͏ʹఆٛ͠ɺ αi ci := P[s1 = αi ] ∈ [0,1] ۩ମྫ
Chapter 5.3: HMM-Based Chord Recognition ભҠ͕֬ҎԼͷͱ͖ ͷΑ͏ͳԻਐߦ͕ى͖Δ֬ S S =
(C, C, C, G, G, F, F, C, C) = c1 ⋅ a11 ⋅ a11 ⋅ a12 ⋅ a22 ⋅ a23 ⋅ a33 ⋅ a31 ⋅ a11 ≈ 1.29 ⋅ 10−4
Chapter 5.3: HMM-Based Chord Recognition ઌఔঢ়ଶܥྻ Λ༻͍ͯɺԻਐߦͷ֬Λܭࢉ͕ͨ͠ ࣮ੈքͰऔΓ͏Δͯ͢ͷ ʹ͍ͭͯܭࢉෆՄೳ S
S ྫ: 10छྨͷԻɺ20͔ͭΒߏ͞ΕΔۂͷ߹ ύλʔϯͷ֬Λܭࢉ͢Δඞཁ͕͋Δ 1020 ͦ͜Ͱ෦ͷঢ়ଶͰͳ͘ɺಛϕΫτϧΛ༻͍ͯ ԻਐߦΛٻΊΔํ๏ͱͯ͠HMMΛར༻͢Δɻ
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition Input: Իσʔλ͔Β؍ଌͨ͠ܥྻ ͔Β ϞσϧΛར༻͠ɺ؍ଌܥྻ Λ࡞ɻ
O = (o1 , ⋯, oN ) B = (β1 , ⋯, βN ) ؍ଌܥྻ ؍ଌγϯϘϧ ͔Βߏ͞ΕΔ B = (β1 , ⋯, βN ) ℬ = {β1 , ⋯, βk } (for k ∈ [1 : K])
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition Viterbi: ؍ଌܥྻ ͱ ॳظঢ়ଶͷ֬
ੜ֬ͱભҠ͔֬Β༗άϥϑΛ࡞ B = (β1 , ⋯, βN ) C = (c1 , c2 , c3 )T = (0.6,0.2,0.3)T ੜ֬ ભҠ֬
Chapter 5.3: HMM-Based Chord Recognition ੜ֬ ભҠ֬ ॳظঢ়ଶͷ֬ ੜ͞Εͨ༗άϥϑ
Chapter 5.3: HMM-Based Chord Recognition ViterbiΞϧΰϦζϜʹΑͬͯ ࠷Β͍͠ܦ࿏ʹ͍ͭͯܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ࠷ޙʹɺͦͷϙΠϯλΛḷΓ࠷Β͍͠ԻਐߦΛ ٻΊΔ A := {α1
= C, α2 = G, α3 = F} ͷͱ͖ ̂ S = (C, C, C, G, G, F)