Upgrade to PRO for Only $50/Year—Limited-Time Offer! 🔥
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fundamentals of Music Processing (Chapter 5)
Search
Koga Kobayashi
December 12, 2019
Research
0
92
Fundamentals of Music Processing (Chapter 5)
Koga Kobayashi
December 12, 2019
Tweet
Share
More Decks by Koga Kobayashi
See All by Koga Kobayashi
第13回 Data-Centric AI勉強会, LLMのファインチューニングデータ
kajyuuen
4
1.7k
基礎数学の公式
kajyuuen
1
160
初等確率論の基礎
kajyuuen
1
180
Deep Markov Model を数式で追う (+ Pyroでの追試)
kajyuuen
0
930
完全なアノテーションが得られない状況下での固有表現抽出
kajyuuen
3
3.6k
SecHack365 北海道会 LT
kajyuuen
0
520
専門用語抽出手法の研究と 抽出アプリケーションの開発
kajyuuen
1
1.3k
Other Decks in Research
See All in Research
LLM-jp-3 and beyond: Training Large Language Models
odashi
1
710
CVPR2025論文紹介:Unboxed
murakawatakuya
0
230
情報技術の社会実装に向けた応用と課題:ニュースメディアの事例から / appmech-jsce 2025
upura
0
280
音声感情認識技術の進展と展望
nagase
0
400
説明可能な機械学習と数理最適化
kelicht
2
730
若手研究者が国際会議(例えばIROS)でワークショップを企画するメリットと成功法!
tanichu
0
120
生成AI による論文執筆サポート・ワークショップ ─ サーベイ/リサーチクエスチョン編 / Workshop on AI-Assisted Paper Writing Support: Survey/Research Question Edition
ks91
PRO
0
120
機械学習と数理最適化の融合 (MOAI) による革新
mickey_kubo
1
440
湯村研究室の紹介2025 / yumulab2025
yumulab
0
250
[IBIS 2025] 深層基盤モデルのための強化学習驚きから理論にもとづく納得へ
akifumi_wachi
18
8.6k
POI: Proof of Identity
katsyoshi
0
120
AWSで実現した大規模日本語VLM学習用データセット "MOMIJI" 構築パイプライン/buiding-momiji
studio_graph
2
1k
Featured
See All Featured
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.8k
Raft: Consensus for Rubyists
vanstee
141
7.2k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
249
1.3M
Large-scale JavaScript Application Architecture
addyosmani
515
110k
Code Review Best Practice
trishagee
74
19k
Reflections from 52 weeks, 52 projects
jeffersonlam
355
21k
Building an army of robots
kneath
306
46k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
35
2.3k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
122
21k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
254
22k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
27k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
9.8k
Transcript
Fundamentals of Music Processing Chapter 5: Chord Recognition খྛ
ᕣՏ εϥΠυʹؚ·ΕΔਤFundamentals of Music ProcessingΑΓҾ༻
Chapter 5: Chord Recognition Chord(Ի) • 3ͭҎ্ͷҟͳΔԻූ͔Βߏ͞ΕΔԻͷ͜ͱ Harmony() • ෳͷԻ͔ΒͳΔܥྻɺԻਐߦ
FM7 G7 Em7 Am Harmony
Chapter 5: Chord Recognition Chord Recognition(Իೝࣝ) • Ի͔ΒԻਐߦΛೝࣝ͢Δٕज़ ԻָϑΝΠϧ͔ΒίʔυේΛࣗಈͰ࡞ग़དྷΔ Իೝ͕ࣝ͏·͍͘͘ͱ…
Chapter 5.3: HMM-Based Chord Recognition Chapter 5.1~5.2 • ಛྔ͔ΒԻΛ͋Δఔਪఆग़དྷΔ ͔͠͠ɺ͜ΕԻҰͭҰ͔ͭ͠ݟ͍ͯͳ͍
Α͘ग़ΔԻܨ͕Γͷڧ͍Իʹண͍ͨ͠ ྫ: I–IV–V–Iਐߦ • FGසग़͠ɺ͍͖ͳΓFmʹߦ͘͜ͱ΄΅ແ͍ HMM(ӅΕϚϧίϑϞσϧ)Λར༻ͯ͠ԻਪఆΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition Ի: , ঢ়ଶ: (for )
ͱͨ͠ͱ͖ɺϚϧίϑੑΛԾఆ͢Δͱ Ͱ͋Δ֬ A := {α1 , α2 , ⋯, αI } s i ∈ [1 : I] sn+1 = αj P[sn+1 = αj |sn = αi , sn−1 = αk , ⋯] = P[sn+1 = αj |P[sn+1 = αj |sn ] ͜͜Ͱ ΛҎԼͷΑ͏ʹఆٛ͢Δɻ(for ) aij i, j ∈ [1 : I] ͜Εঢ়ଶ͕ ͔Β ʹભҠ͢Δ֬ͱߟ͑Δ͜ͱ͕ग़དྷΔ αi αj aij := P[sn+1 = αj |sn = αi ] ∈ [0,1] ·ͣɺϚϧίϑ࿈Λར༻ͨ͠Ի༧ଌʹ͍ͭͯઆ໌͢Δ
Chapter 5.3: HMM-Based Chord Recognition ۩ମྫ; , ͷͱ͖ I =
3 A := {α1 = C, α2 = G, α3 = F} ·ͨ࠷ॳͷঢ়ଶ͕ Ͱ͋Δ֬ΛҎԼͷΑ͏ʹఆٛ͢Δ αi ci := P[s1 = αi ] ∈ [0,1]
Chapter 5.3: HMM-Based Chord Recognition , , ͷͱ͖ ঢ়ଶܥྻ: ʹ͍ͭͯߟ͑Δ
I = 3 A := {α1 = C, α2 = G, α3 = F} C = (c1 , c2 , c3 )T = (0.6,0.2,0.3)T S = (C, C, C, G, G, F, F, C, C) ࠷ॳͷঢ়ଶ͕ Ͱ͋Δ֬ΛҎԼͷΑ͏ʹఆٛ͠ɺ αi ci := P[s1 = αi ] ∈ [0,1] ۩ମྫ
Chapter 5.3: HMM-Based Chord Recognition ભҠ͕֬ҎԼͷͱ͖ ͷΑ͏ͳԻਐߦ͕ى͖Δ֬ S S =
(C, C, C, G, G, F, F, C, C) = c1 ⋅ a11 ⋅ a11 ⋅ a12 ⋅ a22 ⋅ a23 ⋅ a33 ⋅ a31 ⋅ a11 ≈ 1.29 ⋅ 10−4
Chapter 5.3: HMM-Based Chord Recognition ઌఔঢ়ଶܥྻ Λ༻͍ͯɺԻਐߦͷ֬Λܭࢉ͕ͨ͠ ࣮ੈքͰऔΓ͏Δͯ͢ͷ ʹ͍ͭͯܭࢉෆՄೳ S
S ྫ: 10छྨͷԻɺ20͔ͭΒߏ͞ΕΔۂͷ߹ ύλʔϯͷ֬Λܭࢉ͢Δඞཁ͕͋Δ 1020 ͦ͜Ͱ෦ͷঢ়ଶͰͳ͘ɺಛϕΫτϧΛ༻͍ͯ ԻਐߦΛٻΊΔํ๏ͱͯ͠HMMΛར༻͢Δɻ
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition Input: Իσʔλ͔Β؍ଌͨ͠ܥྻ ͔Β ϞσϧΛར༻͠ɺ؍ଌܥྻ Λ࡞ɻ
O = (o1 , ⋯, oN ) B = (β1 , ⋯, βN ) ؍ଌܥྻ ؍ଌγϯϘϧ ͔Βߏ͞ΕΔ B = (β1 , ⋯, βN ) ℬ = {β1 , ⋯, βk } (for k ∈ [1 : K])
Chapter 5.3: HMM-Based Chord Recognition
Chapter 5.3: HMM-Based Chord Recognition Viterbi: ؍ଌܥྻ ͱ ॳظঢ়ଶͷ֬
ੜ֬ͱભҠ͔֬Β༗άϥϑΛ࡞ B = (β1 , ⋯, βN ) C = (c1 , c2 , c3 )T = (0.6,0.2,0.3)T ੜ֬ ભҠ֬
Chapter 5.3: HMM-Based Chord Recognition ੜ֬ ભҠ֬ ॳظঢ়ଶͷ֬ ੜ͞Εͨ༗άϥϑ
Chapter 5.3: HMM-Based Chord Recognition ViterbiΞϧΰϦζϜʹΑͬͯ ࠷Β͍͠ܦ࿏ʹ͍ͭͯܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ੜ͞Εͨ༗άϥϑ ViterbiΞϧΰϦζϜͰ֤ҐஔͰɺ ֤ԻʹͨͲΓண͘·Ͱͷ࠷దίετͱ ͦͷલʹࢸΔ·ͰͷϙΠϯλΛ֮͑ͳ͕ΒܭࢉΛߦ͏
Chapter 5.3: HMM-Based Chord Recognition ࠷ޙʹɺͦͷϙΠϯλΛḷΓ࠷Β͍͠ԻਐߦΛ ٻΊΔ A := {α1
= C, α2 = G, α3 = F} ͷͱ͖ ̂ S = (C, C, C, G, G, F)