A Crowdsourced Experiment for Tempo Estimation of Electronic Dance Music

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS
and Universität Erlangen-Nürnberg A Crowdsourced Experiment for Tempo Estimation of Electronic Dance Music Hendrik Schreiber  tagtraum industries incorporated [email protected] / @h_schreiber Meinard Müller  International AudioLabs Erlangen [email protected]

and Universität Erlangen-Nürnberg EDM Tempo Estimation Piece of Cake?

and Universität Erlangen-Nürnberg GiantSteps Tempo The only EDM Dataset • Released by Knees et al. in 2015 • 664 Beatport previews (2min) • Created by scraping a forum • Used for benchmarking Knees, Peter, et al. "Two Data Sets for Tempo Estimation and Key Detection in Electronic Dance Music Annotated from User Corrections." ISMIR. 2015.

and Universität Erlangen-Nürnberg Results GiantSteps 0% 25% 50% 75% 100% 90.2% Cross DJ Reported Accuracy2 results

and Universität Erlangen-Nürnberg ACM Mirum Gtzan ISMIR Songs GiantSteps 0% 25% 50% 75% 100% 90.2% 93.3% 95% 97.6% Böck et al. Cross DJ Results Reported Accuracy2 results Böck, Sebastian, Florian Krebs, and Gerhard Widmer. "Accurate Tempo Estimation Based on Recurrent Neural Networks and Resonating Comb Filters." ISMIR. 2015.

and Universität Erlangen-Nürnberg ACM Mirum Gtzan ISMIR Songs GiantSteps 0% 25% 50% 75% 100% 90.2% 93.3% 95% 97.6% Böck et al. Cross DJ Results Reported Accuracy2 results ? Böck, Sebastian, Florian Krebs, and Gerhard Widmer. "Accurate Tempo Estimation Based on Recurrent Neural Networks and Resonating Comb Filters." ISMIR. 2015.

and Universität Erlangen-Nürnberg Why is the tempo estimation accuracy so low?

and Universität Erlangen-Nürnberg Experiment • 266 Participants tapped along to half-overlapping 30s segments of GiantSteps tracks • We collected tapping data • 18,684 segment submissions, ~28/track

and Universität Erlangen-Nürnberg Results • Three tracks have no real beat

and Universität Erlangen-Nürnberg 50 100 150 200 0 0.2 0.4 0.6 0.8 BPM salience segments 1-3 segment 4 segments 5-7 Figure 5: Tempo salience distributions for segments of the track ‘Rude Boy feat. Omar LinX Union Vocal Mix’ by Zeds Dead (Beatport id 1728723). The track’s tempo changes in segment 4, leading to four distinct peaks. With JSD = 0.44 its Jensen-Shannon divergence is high. 0 0.2 0.4 0.6 0.8 0 10 20 30 µJSD µJSD + 2 JSD JSD tracks in % Figure 6: Distribution of tracks in the dataset per terval with a bin width of 0.05. The blue line sho and the red line shows µJSD + 2 JSD. Genre A(Tseg) A(Ttrack ) all 0.25 0.26 Tapped Tempo Distribution Results • Three tracks have no real beat • Many tracks exhibit perceptual tempo ambiguities (genre- dependent) Peaks related by factor 2

and Universität Erlangen-Nürnberg 50 100 150 200 0 0.2 0.4 0.6 0.8 BPM salience segments 1-3 segment 4 segments 5-7 Figure 5: Tempo salience distributions for segments of the track ‘Rude Boy feat. Omar LinX Union Vocal Mix’ by Zeds Dead (Beatport id 1728723). The track’s tempo changes in segment 4, leading to four distinct peaks. With JSD = 0.44 its Jensen-Shannon divergence is high. 0 0.2 0.4 0.6 0.8 0 10 20 30 µJSD µJSD + 2 JSD JSD tracks in % Figure 6: Distribution of tracks in the dataset per terval with a bin width of 0.05. The blue line sho and the red line shows µJSD + 2 JSD. Genre A(Tseg) A(Ttrack ) all 0.25 0.26 Tapped Tempo Distribution Results • Three tracks have no real beat • Many tracks exhibit perceptual tempo ambiguities (genre- dependent) • Some tracks contain tempo changes/no global tempo Tempo1 Tempo2

and Universität Erlangen-Nürnberg Results Accuracy1 Accuracy2 0% 25% 50% 75% 100% 91.1% 81.5% Original ground-truth vs. newly derived ground-truth 18.5% 8.1%

and Universität Erlangen-Nürnberg Results Accuracy1 Accuracy2 0% 25% 50% 75% 100% 91.1% 81.5% Original ground-truth vs. newly derived ground-truth 18.5% 8.1% Large disagreement

and Universität Erlangen-Nürnberg Results Accuracy1 Accuracy2 0% 25% 50% 75% 100% 91.1% 81.5% Original ground-truth vs. newly derived ground-truth 18.5% 8.1% Large disagreement Only partially explained by octave error

and Universität Erlangen-Nürnberg Results Accuracy1 for two estimation systems:  original ground-truth vs. new ground truth 0% 25% 50% 75% 100% 70.2% 64.8% 63.1% 58.9% Original New 5.9% 7.1% böck et al. schreiber Böck, Sebastian, Florian Krebs, and Gerhard Widmer. "Accurate Tempo Estimation Based on Recurrent Neural Networks and Resonating Comb Filters." ISMIR. 2015. Schreiber, Hendrik, and M. Müller. "A post-processing procedure for improving music tempo estimates using supervised learning." ISMIR, 2017.

and Universität Erlangen-Nürnberg Results Accuracy1 for two estimation systems:  original ground-truth vs. new ground truth 0% 25% 50% 75% 100% 70.2% 64.8% 63.1% 58.9% Original New 5.9% 7.1% böck et al. schreiber Two different systems perform much better on new ground-truth Böck, Sebastian, Florian Krebs, and Gerhard Widmer. "Accurate Tempo Estimation Based on Recurrent Neural Networks and Resonating Comb Filters." ISMIR. 2015. Schreiber, Hendrik, and M. Müller. "A post-processing procedure for improving music tempo estimates using supervised learning." ISMIR, 2017.

and Universität Erlangen-Nürnberg Results Accuracy2 for two estimation systems:  original ground-truth vs. new ground truth 0% 25% 50% 75% 100% 95.2% 94% 88.7% 86.4% Original New 7.6% 6.5% böck et al. schreiber Böck, Sebastian, Florian Krebs, and Gerhard Widmer. "Accurate Tempo Estimation Based on Recurrent Neural Networks and Resonating Comb Filters." ISMIR. 2015. Schreiber, Hendrik, and M. Müller. "A post-processing procedure for improving music tempo estimates using supervised learning." ISMIR, 2017.

and Universität Erlangen-Nürnberg Results Accuracy2 for two estimation systems:  original ground-truth vs. new ground truth 0% 25% 50% 75% 100% 95.2% 94% 88.7% 86.4% Original New 7.6% 6.5% böck et al. schreiber ! Böck, Sebastian, Florian Krebs, and Gerhard Widmer. "Accurate Tempo Estimation Based on Recurrent Neural Networks and Resonating Comb Filters." ISMIR. 2015. Schreiber, Hendrik, and M. Müller. "A post-processing procedure for improving music tempo estimates using supervised learning." ISMIR, 2017.

and Universität Erlangen-Nürnberg Conclusions • Some tracks are not suitable for global tempo estimation

and Universität Erlangen-Nürnberg Conclusions • Some tracks are not suitable for global tempo estimation • Considerable number of bad annotations

and Universität Erlangen-Nürnberg Conclusions • Some tracks are not suitable for global tempo estimation • Considerable number of bad annotations • Accuracy of State-Of-The-Art systems is higher than previously thought

and Universität Erlangen-Nürnberg Conclusions • Some tracks are not suitable for global tempo estimation • Considerable number of bad annotations • Accuracy of State-Of-The-Art systems is higher than previously thought • We need to improve our datasets!  (tempo distributions, not single tempo annotations)

and Universität Erlangen-Nürnberg Thank you. All data and annotations are available at: http://www.tagtraum.com/tempo_estimation.html

A Crowdsourced Experiment for Tempo Estimation ...

A Crowdsourced Experiment for Tempo Estimation of Electronic Dance Music

Hendrik Schreiber

More Decks by Hendrik Schreiber

Other Decks in Science

Featured

Transcript

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS

INTERNATIONAL AUDIO LABORATORIES ERLANGEN A joint institution of Fraunhofer IIS