Slide 33
Slide 33 text
Research Group for
Geometric Optimization
and Machine Learning
Experiments Summary I
Data
GTZAN dataset comprising 1000 songs with 30 sec. length,
equally divided into 10 genres: blues, classical,
country, disco, hiphop, jazz, metal, pop, reggae,
and rock.
One of the most frequently used datasets in MGR.
But: Exposes several problems such as replications,
mislabelings, and distortions. [6]
Features
CQT
Spectrogram
Features are normalized
Slide 23/36 | Interdisciplinary Project | Music Genre Recognition using Dictionary Learning | July 2013