TY - GEN
T1 - Sambaset
T2 - 20th International Society for Music Information Retrieval Conference, ISMIR 2019
AU - Maia, Lucas S.
AU - Fuentes, Magdalena
AU - Biscainho, Luiz W.P.
AU - Rocamora, Martín
AU - Essid, Slim
N1 - Funding Information:
Authors would like to thank CNPq and CAPES-ANII-CNRS (STIC AmSud program) for funding this work.
Publisher Copyright:
© 2020 International Society for Music Information Retrieval. All rights reserved.
PY - 2019
Y1 - 2019
N2 - In the last few years, several datasets have been released to meet the requirements of "hungry" yet promising datadriven approaches in music technology research. Since, for historical reasons, most investigations conducted in the field still revolve around music of the so-called "Western" tradition, the corresponding data, methodology and conclusions carry a strong cultural bias. Music of non- "Western" background, whenever present, is usually underrepresented, poorly labeled, or even mislabeled, the exception being projects that aim at specifically describing such music. In this paper we present SAMBASET, a dataset of Brazilian samba music that contains over 40 hours of historical and modern samba de enredo commercial recordings. To the best of our knowledge, this is the first dataset of this genre. We describe the collection of metadata (e.g. artist, composer, release date) and outline our semiautomatic approach to the challenging task of annotating beats in this large dataset, which includes the assessment of the performance of state-of-the-art beat tracking algorithms for this specific case. Finally, we present a study on tempo and beat tracking that illustrates SAMBASET's value, and we comment on other tasks for which it could be used.
AB - In the last few years, several datasets have been released to meet the requirements of "hungry" yet promising datadriven approaches in music technology research. Since, for historical reasons, most investigations conducted in the field still revolve around music of the so-called "Western" tradition, the corresponding data, methodology and conclusions carry a strong cultural bias. Music of non- "Western" background, whenever present, is usually underrepresented, poorly labeled, or even mislabeled, the exception being projects that aim at specifically describing such music. In this paper we present SAMBASET, a dataset of Brazilian samba music that contains over 40 hours of historical and modern samba de enredo commercial recordings. To the best of our knowledge, this is the first dataset of this genre. We describe the collection of metadata (e.g. artist, composer, release date) and outline our semiautomatic approach to the challenging task of annotating beats in this large dataset, which includes the assessment of the performance of state-of-the-art beat tracking algorithms for this specific case. Finally, we present a study on tempo and beat tracking that illustrates SAMBASET's value, and we comment on other tasks for which it could be used.
UR - http://www.scopus.com/inward/record.url?scp=85087096223&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85087096223&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85087096223
T3 - Proceedings of the 20th International Society for Music Information Retrieval Conference, ISMIR 2019
SP - 628
EP - 635
BT - Proceedings of the 20th International Society for Music Information Retrieval Conference, ISMIR 2019
A2 - Flexer, Arthur
A2 - Peeters, Geoffroy
A2 - Urbano, Julian
A2 - Volk, Anja
PB - International Society for Music Information Retrieval
Y2 - 4 November 2019 through 8 November 2019
ER -