TY - JOUR
T1 - Measuring structural similarity in music
AU - Bello, Juan P.
N1 - Funding Information:
Manuscript received July 12, 2010; revised November 19, 2010; accepted January 04, 2011. Date of publication February 10, 2011; date of current version July 20, 2011. This material is based upon work supported by the National Science Foundation under Grant IIS-0844654 and by the U.S. Institute of Museum and Library Services under Grant LG-06-08-0073-08. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Bryan Pardo.
PY - 2011
Y1 - 2011
N2 - This paper presents a novel method for measuring the structural similarity between music recordings. It uses recurrence plot analysis to characterize patterns of repetition in the feature sequence, and the normalized compression distance, a practical approximation of the joint Kolmogorov complexity, to measure the pairwise similarity between the plots. By measuring the distance between intermediate representations of signal structure, the proposed method departs from common approaches to music structure analysis which assume a block-based model of music, and thus concentrate on segmenting and clustering sections. The approach ensures that global structure is consistently and robustly characterized in the presence of tempo, instrumentation, and key changes, while the used metric provides a simple to compute, versatile and robust alternative to common approaches in music similarity research. Finally, experimental results demonstrate success at characterizing similarity, while contributing an optimal parameterization of the proposed approach.
AB - This paper presents a novel method for measuring the structural similarity between music recordings. It uses recurrence plot analysis to characterize patterns of repetition in the feature sequence, and the normalized compression distance, a practical approximation of the joint Kolmogorov complexity, to measure the pairwise similarity between the plots. By measuring the distance between intermediate representations of signal structure, the proposed method departs from common approaches to music structure analysis which assume a block-based model of music, and thus concentrate on segmenting and clustering sections. The approach ensures that global structure is consistently and robustly characterized in the presence of tempo, instrumentation, and key changes, while the used metric provides a simple to compute, versatile and robust alternative to common approaches in music similarity research. Finally, experimental results demonstrate success at characterizing similarity, while contributing an optimal parameterization of the proposed approach.
KW - Audio signal processing
KW - computer audition
KW - music information retrieval (MIR)
KW - music structure analysis
KW - sound similarity
UR - http://www.scopus.com/inward/record.url?scp=79960681282&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79960681282&partnerID=8YFLogxK
U2 - 10.1109/TASL.2011.2108287
DO - 10.1109/TASL.2011.2108287
M3 - Article
AN - SCOPUS:79960681282
SN - 1558-7916
VL - 19
SP - 2013
EP - 2025
JO - IEEE Transactions on Audio, Speech and Language Processing
JF - IEEE Transactions on Audio, Speech and Language Processing
IS - 7
M1 - 5711645
ER -