Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fundamentals of Music Processing (Chapter 5)
Search
Koga Kobayashi
December 12, 2019
Research
0
62
Fundamentals of Music Processing (Chapter 5)
Koga Kobayashi
December 12, 2019
Tweet
Share
More Decks by Koga Kobayashi
See All by Koga Kobayashi
第13回 Data-Centric AI勉強会, LLMのファインチューニングデータ
kajyuuen
4
980
基礎数学の公式
kajyuuen
1
120
初等確率論の基礎
kajyuuen
1
160
Deep Markov Model を数式で追う (+ Pyroでの追試)
kajyuuen
0
850
完全なアノテーションが得られない状況下での固有表現抽出
kajyuuen
3
3.4k
SecHack365 北海道会 LT
kajyuuen
0
470
専門用語抽出手法の研究と 抽出アプリケーションの開発
kajyuuen
1
1.2k
Other Decks in Research
See All in Research
Segment Any Change
satai
2
210
eAI (Engineerable AI) プロジェクトの全体像 / Overview of eAI Project
ishikawafyu
0
360
Weekly AI Agents News! 1月号 アーカイブ
masatoto
1
160
博士学位論文予備審査 / Scaling Telemetry Workloads in Cloud Applications: Techniques for Instrumentation, Storage, and Mining
yuukit
1
1.7k
JSAI NeurIPS 2024 参加報告会(AI アライメント)
akifumi_wachi
5
810
Data-centric AI勉強会 「ロボットにおけるData-centric AI」
haraduka
0
430
言語モデルLUKEを経済の知識に特化させたモデル「UBKE-LUKE」について
petter0201
0
190
ラムダ計算の拡張に基づく 音楽プログラミング言語mimium とそのVMの実装
tomoyanonymous
0
390
Poster: Feasibility of Runtime-Neutral Wasm Instrumentation for Edge-Cloud Workload Handover
chikuwait
0
330
Weekly AI Agents News! 11月号 プロダクト/ニュースのアーカイブ
masatoto
0
290
20241115都市交通決起集会 趣旨説明・熊本事例紹介
trafficbrain
0
980
AWS 音声基盤モデル トーク解析AI MiiTelの音声処理について
ken57
0
130
Featured
See All Featured
Git: the NoSQL Database
bkeepers
PRO
427
64k
Agile that works and the tools we love
rasmusluckow
328
21k
Art, The Web, and Tiny UX
lynnandtonic
298
20k
The Invisible Side of Design
smashingmag
299
50k
Visualization
eitanlees
146
15k
Documentation Writing (for coders)
carmenintech
67
4.6k
Keith and Marios Guide to Fast Websites
keithpitt
411
22k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
366
25k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
33
2.1k
Intergalactic Javascript Robots from Outer Space
tanoku
270
27k
Music & Morning Musume
bryan
46
6.3k
Into the Great Unknown - MozCon
thekraken
35
1.6k
Transcript
Fundamentals of Music Processing Chapter 5: Chord Recognition খྛ
ᕣՏ εϥΠυʹؚ·ΕΔਤFundamentals of Music ProcessingΑΓҾ༻
Chapter 5: Chord Recognition Chord(Ի) • 3ͭҎ্ͷҟͳΔԻූ͔Βߏ͞ΕΔԻͷ͜ͱ Harmony() • ෳͷԻ͔ΒͳΔܥྻɺԻਐߦ
FM7 G7 Em7 Am Harmony
Chapter 5: Chord Recognition Chord Recognition(Իೝࣝ) • Ի͔ΒԻਐߦΛೝࣝ͢Δٕज़ ԻָϑΝΠϧ͔ΒίʔυේΛࣗಈͰ࡞ग़དྷΔ Իೝ͕ࣝ͏·͍͘͘ͱ…
Chapter 5.3: HMM-Based Chord Recognition Chapter 5.1~5.2 • ಛྔ͔ΒԻΛ͋Δఔਪఆग़དྷΔ ͔͠͠ɺ͜ΕԻҰͭҰ͔ͭ͠ݟ͍ͯͳ͍
Α͘ग़ΔԻܨ͕Γͷڧ͍Իʹண͍ͨ͠ ྫ: I–IV–V–Iਐߦ • FGසग़͠ɺ͍͖ͳΓFmʹߦ͘͜ͱ΄΅ແ͍ HMM(ӅΕϚϧίϑϞσϧ)Λར༻ͯ͠ԻਪఆΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition Ի: , ঢ়ଶ: (for )
ͱͨ͠ͱ͖ɺϚϧίϑੑΛԾఆ͢Δͱ Ͱ͋Δ֬ A := {α1 , α2 , ⋯, αI } s i ∈ [1 : I] sn+1 = αj P[sn+1 = αj |sn = αi , sn−1 = αk , ⋯] = P[sn+1 = αj |P[sn+1 = αj |sn ] ͜͜Ͱ ΛҎԼͷΑ͏ʹఆٛ͢Δɻ(for ) aij i, j ∈ [1 : I] ͜Εঢ়ଶ͕ ͔Β ʹભҠ͢Δ֬ͱߟ͑Δ͜ͱ͕ग़དྷΔ αi αj aij := P[sn+1 = αj |sn = αi ] ∈ [0,1] ·ͣɺϚϧίϑ࿈Λར༻ͨ͠Ի༧ଌʹ͍ͭͯઆ໌͢Δ
Chapter 5.3: HMM-Based Chord Recognition ۩ମྫ; , ͷͱ͖ I =
3 A := {α1 = C, α2 = G, α3 = F} ·ͨ࠷ॳͷঢ়ଶ͕ Ͱ͋Δ֬ΛҎԼͷΑ͏ʹఆٛ͢Δ αi ci := P[s1 = αi ] ∈ [0,1]
Chapter 5.3: HMM-Based Chord Recognition , , ͷͱ͖ ঢ়ଶܥྻ: ʹ͍ͭͯߟ͑Δ
I = 3 A := {α1 = C, α2 = G, α3 = F} C = (c1 , c2 , c3 )T = (0.6,0.2,0.3)T S = (C, C, C, G, G, F, F, C, C) ࠷ॳͷঢ়ଶ͕ Ͱ͋Δ֬ΛҎԼͷΑ͏ʹఆٛ͠ɺ αi ci := P[s1 = αi ] ∈ [0,1] ۩ମྫ
Chapter 5.3: HMM-Based Chord Recognition ભҠ͕֬ҎԼͷͱ͖ ͷΑ͏ͳԻਐߦ͕ى͖Δ֬ S S =
(C, C, C, G, G, F, F, C, C) = c1 ⋅ a11 ⋅ a11 ⋅ a12 ⋅ a22 ⋅ a23 ⋅ a33 ⋅ a31 ⋅ a11 ≈ 1.29 ⋅ 10−4
Chapter 5.3: HMM-Based Chord Recognition ઌఔঢ়ଶܥྻ Λ༻͍ͯɺԻਐߦͷ֬Λܭࢉ͕ͨ͠ ࣮ੈքͰऔΓ͏Δͯ͢ͷ ʹ͍ͭͯܭࢉෆՄೳ S
S ྫ: 10छྨͷԻɺ20͔ͭΒߏ͞ΕΔۂͷ߹ ύλʔϯͷ֬Λܭࢉ͢Δඞཁ͕͋Δ 1020 ͦ͜Ͱ෦ͷঢ়ଶͰͳ͘ɺಛϕΫτϧΛ༻͍ͯ ԻਐߦΛٻΊΔํ๏ͱͯ͠HMMΛར༻͢Δɻ
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition Input: Իσʔλ͔Β؍ଌͨ͠ܥྻ ͔Β ϞσϧΛར༻͠ɺ؍ଌܥྻ Λ࡞ɻ
O = (o1 , ⋯, oN ) B = (β1 , ⋯, βN ) ؍ଌܥྻ ؍ଌγϯϘϧ ͔Βߏ͞ΕΔ B = (β1 , ⋯, βN ) ℬ = {β1 , ⋯, βk } (for k ∈ [1 : K])
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition Viterbi: ؍ଌܥྻ ͱ ॳظঢ়ଶͷ֬
ੜ֬ͱભҠ͔֬Β༗άϥϑΛ࡞ B = (β1 , ⋯, βN ) C = (c1 , c2 , c3 )T = (0.6,0.2,0.3)T ੜ֬ ભҠ֬
Chapter 5.3: HMM-Based Chord Recognition ੜ֬ ભҠ֬ ॳظঢ়ଶͷ֬ ੜ͞Εͨ༗άϥϑ
Chapter 5.3: HMM-Based Chord Recognition ViterbiΞϧΰϦζϜʹΑͬͯ ࠷Β͍͠ܦ࿏ʹ͍ͭͯܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ࠷ޙʹɺͦͷϙΠϯλΛḷΓ࠷Β͍͠ԻਐߦΛ ٻΊΔ A := {α1
= C, α2 = G, α3 = F} ͷͱ͖ ̂ S = (C, C, C, G, G, F)