Upgrade to Pro — share decks privately, control downloads, hide ads and more …

国際会議ISMIR2022報告(山本分)

Yuya Yamamoto
February 27, 2023

 国際会議ISMIR2022報告(山本分)

2023年2月27日に行われた音楽情報科学研究会の発表資料です.

Yuya Yamamoto

February 27, 2023
Tweet

More Decks by Yuya Yamamoto

Other Decks in Science

Transcript

  1. I S M I R ͷ Α ͔ ͬ ͨ

    ͱ ͜ Ζ 3 બ 2 0 2 3 . 0 2 . 2 7 S I G M U S 1 3 6 I S M I R 2 0 2 2 ใ ࠂ ʙ ͍ ͪ ࢀ Ճ ऀ ͷ ໨ ઢ ͔ Β ʙ ɹ ɹ ɹ ஜ ೾ େ ֶ ࢁ ຊ ༤ ໵ 2022.12.08 Ұॹͷϗςϧͷϝϯόʔͨͪͱɽ
  2. ࣗ ݾ ঺ հ • ࢁຊ ༤໵ (΍·΋ͱ Ώ͏΍) •

    ஜ೾େֶ ਓͱԻͷ৘ใֶݚڀࣨʢࣉᖒݚʣ ത࢜ޙظ՝ఔ2೥ • (2022. 08-12: ؖࠃ KAIST ๚໰ݚڀһ) • J-POPͷ”ՎএςΫχοΫ”Λ൐૗͖ͭͷՎ ͔Βࣗಈݕग़͢Δٕज़ʹ͍ͭͯݱ஍Ͱൃද • Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers (ࢁຊ, Nam, ࣉᖒ) • Juhan Nam (KAISTʣͱͷڞಉݚڀ C U LT U R A L D I V E R S I T Y ʹ ৐ ͬͯ ண ෺ Ͱ ൃ ද ʂ
  3. I S M I R ͷ ͍ ͍ ͱ ͜

    Ζ ᶃ Ի ָ ৘ ใ ॲ ཧ ͷ ࠷ ઌ ୺ Λ ஌ Ε Δ
  4. ٞ ࿦ Λ ଅ ͢ ۭ ؒ ͕ ੔ ͑Β

    Εͯ ͍ Δ I S M I R ͷ ͍ ͍ ͱ ͜ Ζ ᶄ
  5. ൃ ද ͷ Α ͏͢ ޱ಄ൃද x 4෼ ϙελʔൃද x

    1࣌ؒ (ݱ஍ͷΈ) օ͕͋Δఔ౓಺༰Λ஌ͬͨঢ়ଶͰདྷͯ͘ΕΔ+ ΈͬͪΓٞ࿦Ͱ͖Δʂ
  6. ԕ ִ ͷ ࢀ Ճ ऀ ͱ ͷ Π ϯ

    λ ϥ Ϋ γ ϣ ϯ ֶձଆͰఏڙ͍ͯ͠ΔSlackͰΈͬͪΓٞ࿦ ԕִࢀՃऀͱͷϏσΦνϟοτ΋ൃੜʂ
  7. ੈ ք த ͷ ݚ ڀ ऀ ͱ ͭ ͳ

    ͕ Γ ͕ Ͱ ͖ Δ I S M I R ͷ ͍ ͍ ͱ ͜ Ζ ᶅ
  8. I S M I R ͱ ͍ ͏ ֶ ձ

    ͷ ಛ ௃ • ʢൺֱతʣগਓ਺ • γϯάϧτϥοΫ • ࢀՃऀಉ࢜ͷͭͳ͕ΓΛੵۃతʹଅ͍ͯ͠Δ • ΈΜͳԻָʹؔ͢ΔݚڀΛ͍ͯ͠Δʢˡେࣄʣ
  9. ݱ ஍ ͳ ΒͰ ͸ ͷ ͜ ͱ όϯέοτ΍ίʔώʔλΠϜͰͷஊসλΠϜ Իָؔ࿈ͷϓϩάϥϜ

    ੈքதͷMIRݚڀऀͨͪͱɼԻָΛͭ·Έʹݚڀ΍ ͦΕͧΕͷจԽɾੜ׆ʹ͍ͭͯஊসͰ͖Δ
  10. ݸ ਓ త ʹ ࢥ ͏ ࠃ ࡍ ձ ٞ

    ʢ ର ໘ ʣ ͷ ͏ · Έ • ࠷ઌ୺ͷݚڀΛ௚ʹ৮Εͯ஌Δ͜ͱ͕Ͱ͖Δ • ΈͬͪΓٞ࿦Ͱ͖Δ • ੈքதͷݚڀऀͱͷग़ձ͍͕͋Δ • ҟͳΔ஍ͰҟͳΔจԽʹ৮ΕΔ͜ͱ͕Ͱ͖Δ • ISMIR͸ͦͷ͏·ΈΛ࠷େԽͨ͠ࠃࡍձٞʂ
  11. ͓ · ͚ ɿ ࢀ Ճ ࣌ ʹ΍ ͬͯΑ ͔

    ͬ ͨ ͜ ͱ 5 બ • શͯͷ࿦จʹͬ͟ͱ໨Λ௨͢ • ࣭໰Λͱʹ͔͘౤͔͚͛ΔʢSlackͰ΋ϙελʔͰ΋ʣ • ੈքͷ஍ཧ΍จԽʹৄ͘͠ͳ͓͖ͬͯɼ࿩ͷωλΛ࡞Δ • ண෺ΛணΔࣗ෼ͷࠃͰ࿩ͤΔωλΛ͓࣋ͬͯ͘ • λϒϨοτ୺຤Λങ͏ʢϙελʔൃදͰ͸ύιίϯΑΓ ॏๅʣ
  12. ӳ ޠ ؔ ࿈ Ͱ ݸ ਓ త ʹ΍ ͬͯΑ

    ͔ ͬ ͨ ͜ ͱ • ӳޠ͕ۤख͔ͩΒͱҤॖ͗͢͠ͳ͍ • ͱʹ͔͘ϦεχϯάΛຏ͘ • पΓ͸౰ͨΓલ͕ͩωΠςΟϒεϐʔυ • ൃԻʹ͍ͭͯ஌Δͷʹ͓͢͢Ί-> https://youtu.be/nEpewJsEgUg • ֤ࠃͷᨅΓͷบ΋஌͓ͬͯ͘ͱ͍͍͔΋ • ࣌ʹ͸σδλϧΛۦ࢖ • ୯ޠ͕ࢥ͍ු͔͹ͳ͍࣌“Ah~ What should I say… (खݩͰg◦ogle຋༁)” Ͱ৐Γ੾ͬͨ৔໘͕͋ͬͨ
  13. ΄ ͔ ҹ ৅ త ͩ ͬ ͨ ͜ ͱ

    • ؖࠃ੎ͷ୆಄ • ࠓճ20ਓ͕ۙ͘ࢀՃɽKorean-SMIR dinner͕2ճ։࠵ ʢࢁຊ͸KAISTͷΑ͠ΈͰ2ճ໨ʹ͓अຐͨ͠ʣ • एखͷΞΫςΟϒ͞ • ֶੜͰ΋ੵۃతʹslack౳ʹϙετ • ศ৐ͨ͠ˠ
  14. ͓ · ͚ ɿ Π ϯ υ ͷ ͜ ΅

    Ε ࿩ ᶃ • ΧϨʔͷछྨ͕๛෋ • ຖճ10छྨ͸ඞͣ͋ͬͨɽϨύʔτϦʔ΋๛෋ɽΧϨʔ޷͖ͳࢲʹ͸ఱࠃͩͬͨɽ • ຖճUberͷར༻͕େม… • ͳ͔ͳ͔͔ͭ·Βͳ͍ • ੕4ϗςϧΛࣗྗͰݟ͚͕ͭͨɼγϟτϧόε͕ఏڙ͞Ε͍ͯΔެࣜఏڙͷϗςϧʹധ ·ͬͨ΄͏͕Α͔͔ͬͨ΋ • ෲ௧ʹ஫ҙʂ • ಉߦϝϯόʔ6ਓத5ਓෲ௧ʹ…
  15. ͓ · ͚ ɿ Π ϯ υ ͷ ͜ ΅

    Ε ࿩ ᶄ • Χʔυ͕࢖͑ͳ͍৔ॴଟ͠ʂ • ؖࠃ͔Βͷ౉ߤͰɼݱۚ͸KRW͔࣋ͬͯ͠ͳ͔ͬͨʢͦͷ্KRW྆ସෆՄʣɽɹɹɹɹɹɹɹ ۭߓͰ൧΋৯͑ͣ1్࣌ؒํʹ฻Ε͍ͯͨ…ʢޙͰATMΛݟ͚ͭͳΜͱ͔ͳΔʣ • ࢸΔॴʹ໺ݘ͕ʂ • ೔ຊͷോ͘Β͍਺͕͍Δײ֮. ҙ֎ʹ΋ਓջ͍ͬ͜ • ձ৔ۙ͘Ͱ͸໺ੜͷݘͱࣛʁ͕݌՞͢Δϋϓχϯά΋ • ंͷΫϥΫγϣϯͷԻ৭ͷଟ༷͞ • ं͸΢ΠϯΧʔͷ୅ΘΓʹΫϥΫγϣϯΛ࢖͏จԽɽָثΈ͍ͨͰ֗த͕·ΔͰύϨʔυ
  16. ҹ ৅ ʹ ࢒ ͬ ͨ ݚ ڀ ᶃ ɿ

    ଟ ໘ త ͳ ࣝ ผ • Իָͷ΋ͭෳ਺ͷཁૉʹண໨͠ɼෳ਺ͷಛ௃Λ࢖͏ • End-to-End Lyrics Transcription Informed by Pitch and Onset Estimation ՎࢺࣝผʹϐονͱΦϯηοτΛ࢖͏ • And what if two musical versions don't share melody, harmony, rhythm, or lyrics? ΧόʔιϯάࣝผʹϐονɼϋʔϞχʔɼϦζ ϜɼՎࢺಛ௃Λ૊Έ߹ΘͤɼSoTA ʢଟ༷ͳόʔδϣϯҧ͍ʹదԠʣ • Verse versus Chorus: Structure-aware Feature Extraction for Lyrics-based Genre Recognition ՎࢺϕʔεͷδϟϯϧࣝผʹʮͲ ͷηΫγϣϯͷՎࢺ͔ʁʯͷӨڹΛௐࠪ
  17. • సҠֶश • Melody transcription via generative pre-training ϝϩσΟͷָේΛग़ྗɼJukebox΍MT3ͷembeddingΛར༻ •

    Singing beat tracking with Self-supervised front-end and linear transformers Վ”ͷΈ”Λೖྗͱ͢ΔϏʔτਪఆɽWavLM΍ DistillHuBERTΛೖྗʹར༻ • Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription Wav2Vec2.0Λ༻͍ͨՎࢺࣝผɼগͳֶ͍शσʔλͰ΋େྔσʔ λͰֶशͤͨ͞Ϟσϧʹඖఢ͢ΔੑೳΛ࣮ݱ • σʔλ֦ு • PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription େྔͷ࿩੠Λ”ϝϩσΟ Խ”ֶ͠शσʔλʹՃ͑ΔՎࢺࣝผ • Scaling Polyphonic Transcription with Mixtures of Monophonic Transcriptions ෳ਺ͷ୯Իͷԋ૗σʔλΛॏͶ߹ΘͤͨσʔλΛֶ शͨ͠ෳ਺ָث࠾ේ • ίετߟྀܕֶश • Toward postprocessing-free neural networks for joint beat and downbeat estimation ޮ཰తͳ৞ΈࠐΈϞδϡʔϧʢSeparable convʣͱଛࣦؔ਺ʢFocal loss+DICE lossʣΛར༻ͨ͠ޙॲཧ͍ΒͣͷϏʔτਪఆ • Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers (ࣗ෼ͷ) Focal lossΛར༻͠୹͍ՎএςΫχο Ϋͷݕग़ੑೳUp ҹ ৅ ʹ ࢒ ͬ ͨ ݚ ڀ ᶄ ɿ σ ʔ λ ෆ ଍ ΁ ͷ ର Ԡ
  18. • Traces of Globalization in Online Music Consumption Patterns and

    Results of Recommendation Algorithms ԻָਪનγεςϜ ͕ԻָͷGlobalizationΛଅ͔͢Ͳ͏͔Λௐ΂ͨ -> USԻָ͕Ϛʔ έοτΛಠ઎͍ͯ͠Δࠃ͕ଟ͘ɼNNϕʔεͷਪનγεςϜ͕ GlobalizationΛଅ͕͢USԻָͷಠ઎ੑ΋ॿ௕͢Δ͜ͱΛࣔࠦ • Violin Etudes: A Comprehensive Dataset for f0 Estimation and Performance Analysis όΠΦϦϯԋ૗ͷͨΊͷf0ͷΞϊςʔγϣ ϯσʔληοτ CREPE୯ମͰ͸ਖ਼͘͠ਪఆͰ͖ͳ͍͜ͱΛ֬ೝ ͠ɼஞ࣍తʹf0Λਪఆ->ֶशʹՃ͑Δͱ͍͏ϧʔϓͰσʔληο τΛߏங ҹ ৅ ʹ ࢒ ͬ ͨ ݚ ڀ ᶅ ɿ ͦ ͷ ଞ ॾ ʑ