Slide 4
Slide 4 text
MIDI-level token representations have been actively explored
in music generation.
• Music Transformer [CZ Huang et al, 2019]
• MIDI-like representation × Transformer
• Generates long and coherent music
• Pop Music Transformer [YS Huang et al, 2020]
• REMI representation × Transformer-XL
• Generates beat-aligned music
4
MIDI × Transformer
Transformers work well with MIDI-level token representations.
(w/ relative pos encoding)
How about score-level token representation?