TY - GEN
T1 - A robust mid-level representation for harmonic content in music signals
AU - Bello, Juan P.
AU - Pickens, Jeremy
PY - 2005
Y1 - 2005
N2 - When considering the problem of audio-to-audio matching, determining musical similarity using low-level features such as Fourier transforms and MFCCs is an extremely difficult task, as there is little semantic information available. Full semantic transcription of audio is an unreliable and imperfect task in the best case, an unsolved problem in the worst. To this end we propose a robust mid-level representation that incorporates both harmonic and rhythmic information, without attempting full transcription. We describe a process for creating this representation automatically, directly from multi-timbral and polyphonic music signals, with an emphasis on popular music. We also offer various evaluations of our techniques. Moreso than most approaches working from raw audio, we incorporate musical knowledge into our assumptions, our models, and our processes. Our hope is that by utilizing this notion of a musically-motivated mid-level representation we may help bridge the gap between symbolic and audio research.
AB - When considering the problem of audio-to-audio matching, determining musical similarity using low-level features such as Fourier transforms and MFCCs is an extremely difficult task, as there is little semantic information available. Full semantic transcription of audio is an unreliable and imperfect task in the best case, an unsolved problem in the worst. To this end we propose a robust mid-level representation that incorporates both harmonic and rhythmic information, without attempting full transcription. We describe a process for creating this representation automatically, directly from multi-timbral and polyphonic music signals, with an emphasis on popular music. We also offer various evaluations of our techniques. Moreso than most approaches working from raw audio, we incorporate musical knowledge into our assumptions, our models, and our processes. Our hope is that by utilizing this notion of a musically-motivated mid-level representation we may help bridge the gap between symbolic and audio research.
KW - Harmonic description
KW - Music similarity
KW - Segmentation
UR - http://www.scopus.com/inward/record.url?scp=84873553947&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84873553947&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84873553947
SN - 9780955117909
T3 - ISMIR 2005 - 6th International Conference on Music Information Retrieval
SP - 304
EP - 311
BT - ISMIR 2005 - 6th International Conference on Music Information Retrieval
T2 - 6th International Conference on Music Information Retrieval, ISMIR 2005
Y2 - 11 September 2005 through 15 September 2005
ER -