A robust mid-level representation for harmonic content in music signals

Juan P. Bello, Jeremy Pickens

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When considering the problem of audio-to-audio matching, determining musical similarity using low-level features such as Fourier transforms and MFCCs is an extremely difficult task, as there is little semantic information available. Full semantic transcription of audio is an unreliable and imperfect task in the best case, an unsolved problem in the worst. To this end we propose a robust mid-level representation that incorporates both harmonic and rhythmic information, without attempting full transcription. We describe a process for creating this representation automatically, directly from multi-timbral and polyphonic music signals, with an emphasis on popular music. We also offer various evaluations of our techniques. Moreso than most approaches working from raw audio, we incorporate musical knowledge into our assumptions, our models, and our processes. Our hope is that by utilizing this notion of a musically-motivated mid-level representation we may help bridge the gap between symbolic and audio research.

Original languageEnglish (US)
Title of host publicationISMIR 2005 - 6th International Conference on Music Information Retrieval
Pages304-311
Number of pages8
StatePublished - 2005
Event6th International Conference on Music Information Retrieval, ISMIR 2005 - London, United Kingdom
Duration: Sep 11 2005Sep 15 2005

Publication series

NameISMIR 2005 - 6th International Conference on Music Information Retrieval

Other

Other6th International Conference on Music Information Retrieval, ISMIR 2005
CountryUnited Kingdom
CityLondon
Period9/11/059/15/05

Keywords

  • Harmonic description
  • Music similarity
  • Segmentation

ASJC Scopus subject areas

  • Music
  • Information Systems

Fingerprint Dive into the research topics of 'A robust mid-level representation for harmonic content in music signals'. Together they form a unique fingerprint.

Cite this