TY - JOUR
T1 - Measuring musical rhythm similarity
T2 - Statistical features versus transformation methods
AU - Beltran, Juan F.
AU - Liu, Xiaohua
AU - Mohanchandra, Nishant
AU - Toussaint, Godfried T.
N1 - Funding Information:
This research was supported by a grant from the Provost's O±ce of New York University Abu Dhabi, through the Faculty of Science, in Abu Dhabi, The United Arab Emirates. Preliminary work was done in the fall of 2012 in Abu Dhabi, and the
Publisher Copyright:
© 2015 World Scientific Publishing Company.
PY - 2015/3/25
Y1 - 2015/3/25
N2 - Two approaches to measuring the similarity between symbolically notated musical rhythms are compared with each other and with human judgments of perceived similarity. The first is the edit-distance, a popular transformation method, applied to the symbolic rhythm sequences. The second approach employs the histograms of the inter-onset-intervals (IOIs) calculated from the rhythms. Furthermore, two methods for dealing with the histograms are also compared. The first utilizes the Mallows distance, a transformation method akin to the Earth-Movers distance popular in computer vision, and the second extracts a group of standard statistical features, used in music information retrieval, from the IOI-histograms. The measures are compared using four contrastive musical rhythm data sets by means of statistical Mantel tests that compute correlation coefficients between the various dissimilarity matrices. The results provide evidence from the aural domain, that transformation methods such as the edit distance are superior to feature-based methods for predicting human judgments of similarity. The evidence also supports the hypothesis that IOI-histogram-based methods are better than music-theoretical structural features computed from the rhythms themselves, provided that the rhythms do not share identical IOI histograms.
AB - Two approaches to measuring the similarity between symbolically notated musical rhythms are compared with each other and with human judgments of perceived similarity. The first is the edit-distance, a popular transformation method, applied to the symbolic rhythm sequences. The second approach employs the histograms of the inter-onset-intervals (IOIs) calculated from the rhythms. Furthermore, two methods for dealing with the histograms are also compared. The first utilizes the Mallows distance, a transformation method akin to the Earth-Movers distance popular in computer vision, and the second extracts a group of standard statistical features, used in music information retrieval, from the IOI-histograms. The measures are compared using four contrastive musical rhythm data sets by means of statistical Mantel tests that compute correlation coefficients between the various dissimilarity matrices. The results provide evidence from the aural domain, that transformation methods such as the edit distance are superior to feature-based methods for predicting human judgments of similarity. The evidence also supports the hypothesis that IOI-histogram-based methods are better than music-theoretical structural features computed from the rhythms themselves, provided that the rhythms do not share identical IOI histograms.
KW - Mantel test
KW - Musical rhythm
KW - edit distance
KW - inter-onset interval histograms
KW - mallows distance
KW - music information retrieval
KW - similarity measures
KW - symbolic music notation
KW - transformations
UR - http://www.scopus.com/inward/record.url?scp=84928487225&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84928487225&partnerID=8YFLogxK
U2 - 10.1142/S0218001415500093
DO - 10.1142/S0218001415500093
M3 - Article
AN - SCOPUS:84928487225
SN - 0218-0014
VL - 29
JO - International Journal of Pattern Recognition and Artificial Intelligence
JF - International Journal of Pattern Recognition and Artificial Intelligence
IS - 2
M1 - 1550009
ER -