P1-07: Collaborative Song Dataset (CoSoD): An Annotated Dataset of Multi-Artist Collaborations in Popular Music

Michèle Duguay (Harvard University)*, Kate Mancey (Harvard University), Johanna Devaney (Brooklyn College)

Subjects (starting with primary): Evaluation, datasets, and reproducibility -> novel datasets and use cases ; Musical features and properties -> timbre, instrumentation, and singing voice

Presented In Person: 4-minute short-format presentation

Abstract:

The Collaborative Song Dataset (CoSoD) is a corpus of 331 multi-artist collaborations from the 2010–2019 Billboard “Hot 100” year-end charts. The corpus is annotated with formal sections, aspects of vocal production (including reverberation, layering, panning, and gender of the performers), and relevant metadata. CoSoD complements other popular music datasets by focusing exclusively on musical collaborations between independent acts. In addition to facilitating the study of song form and vocal production, CoSoD allows for the in-depth study of gender as it relates to various timbral, pitch, and formal parameters in musical collaborations. In this paper, we detail the contents of the dataset and outline the annotation process. We also present an experiment using CoSoD that examines how the use of reverberation, layering, and panning are related to the gender of the artist. In this experiment, we find that men’s voices are on average treated with less reverberation and occupy a more narrow position in the stereo mix than women’s voices.

If the video does not load properly please use the direct link to video