Structured training for large-vocabulary chord recognition

Brian McFee, Juan Pablo Bello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Automatic chord recognition systems operating in the large-vocabulary regime must overcome data scarcity: certain classes occur much less frequently than others, and this presents a significant challenge when estimating model parameters. While most systems model the chord recognition task as a (multi-class) classification problem, few attempts have been made to directly exploit the intrinsic structural similarities between chord classes. In this work, we develop a deep convolutional-recurrent model for automatic chord recognition over a vocabulary of 170 classes. To exploit structural relationships between chord classes, the model is trained to produce both the time-varying chord label sequence as well as binary encodings of chord roots and qualities. This binary encoding directly exposes similarities between related classes, allowing the model to learn a more coherent representation of simultaneous pitch content. Evaluations on a corpus of 1217 annotated recordings demonstrate substantial improvements compared to previous models.

Original languageEnglish (US)
Title of host publicationProceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017
EditorsSally Jo Cunningham, Zhiyao Duan, Xiao Hu, Douglas Turnbull
PublisherInternational Society for Music Information Retrieval
Pages188-194
Number of pages7
ISBN (Electronic)9789811151798
StatePublished - 2017
Event18th International Society for Music Information Retrieval Conference, ISMIR 2017 - Suzhou, China
Duration: Oct 23 2017Oct 27 2017

Publication series

NameProceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017

Conference

Conference18th International Society for Music Information Retrieval Conference, ISMIR 2017
CountryChina
CitySuzhou
Period10/23/1710/27/17

ASJC Scopus subject areas

  • Music
  • Information Systems

Fingerprint Dive into the research topics of 'Structured training for large-vocabulary chord recognition'. Together they form a unique fingerprint.

  • Cite this

    McFee, B., & Bello, J. P. (2017). Structured training for large-vocabulary chord recognition. In S. J. Cunningham, Z. Duan, X. Hu, & D. Turnbull (Eds.), Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017 (pp. 188-194). (Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017). International Society for Music Information Retrieval.