Deep salience representations for F0 estimation in polyphonic music

Rachel M. Bittner, Brian McFee, Justin Salamon, Peter Li, Juan P. Bello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Estimating fundamental frequencies in polyphonic music remains a notoriously difficult task in Music Information Retrieval. While other tasks, such as beat tracking and chord recognition have seen improvement with the application of deep learning models, little work has been done to apply deep learning methods to fundamental frequency related tasks including multi-f0 and melody tracking, primarily due to the scarce availability of labeled data. In this work, we describe a fully convolutional neural network for learning salience representations for estimating fundamental frequencies, trained using a large, semi-automatically generated f0 dataset. We demonstrate the effectiveness of our model for learning salience representations for both multi-f0 and melody tracking in polyphonic audio, and show that our models achieve state-of-the-art performance on several multi-f0 and melody datasets. We conclude with directions for future research.

Original languageEnglish (US)
Title of host publicationProceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017
EditorsSally Jo Cunningham, Zhiyao Duan, Xiao Hu, Douglas Turnbull
PublisherInternational Society for Music Information Retrieval
Pages63-70
Number of pages8
ISBN (Electronic)9789811151798
StatePublished - 2017
Event18th International Society for Music Information Retrieval Conference, ISMIR 2017 - Suzhou, China
Duration: Oct 23 2017Oct 27 2017

Publication series

NameProceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017

Conference

Conference18th International Society for Music Information Retrieval Conference, ISMIR 2017
Country/TerritoryChina
CitySuzhou
Period10/23/1710/27/17

ASJC Scopus subject areas

  • Music
  • Information Systems

Fingerprint

Dive into the research topics of 'Deep salience representations for F0 estimation in polyphonic music'. Together they form a unique fingerprint.

Cite this