P6-11: Self-Similarity-Based and Novelty-Based Loss for Music Structure Analysis
Geoffroy Peeters (LTCI - Télécom Paris, IP Paris)*
Subjects (starting with primary): MIR fundamentals and methodology -> music signal processing ; Musical features and properties -> structure, segmentation, and form
Presented In Person: 4-minute short-format presentation
Music Structure Analysis (MSA) is the task aiming at identifying musical segments that compose a music track and possibly label them based on their similarity.
In this paper we propose a supervised approach for the task of music boundary detection. In our approach we simultaneously learn features and convolution kernels.
For this we jointly optimize
- a loss based on the Self-Similarity-Matrix (SSM) obtained with the learned features, denoted by SSM-loss, and
- a loss based on the novelty score obtained applying the learned kernels to the estimated SSM, denoted by novelty-loss.
We also demonstrate that relative feature learning, through self-attention, is beneficial for the task of MSA.
Finally, we compare the performances of our approach to previously proposed approaches on the standard RWC-Pop, and various subsets of SALAMI.
If the video does not load properly please use the direct link to video