TY - GEN
T1 - Four timely insights on automatic chord estimation
AU - Humphrey, Eric J.
AU - Bello, Juan P.
N1 - Publisher Copyright:
© Eric J. Humphrey, Juan P. Bello.
PY - 2015
Y1 - 2015
N2 - Automatic chord estimation (ACE) is a hallmark research topic in content-based music informatics, but like many other tasks, system performance appears to be converging to yet another glass ceiling. Looking toward trends in other machine perception domains, one might conclude that complex, data-driven methods have the potential to significantly advance the state of the art. Two recent efforts did exactly this for large-vocabulary ACE, but despite arguably achieving some of the highest results to date, both approaches plateau well short of having solved the problem. Therefore, this work explores the behavior of these two high performing, systems as a means of understanding obstacles and limitations in chord estimation, arriving at four critical observations: one, music recordings that invalidate tacit assumptions about harmony and tonality result in erroneous and even misleading performance; two, standard lexicons and comparison methods struggle to reflect the natural relationships between chords; three, conventional approaches conflate the competing goals of recognition and transcription to some undefined degree; and four, the perception of chords in real music can be highly subjective, making the very notion of “ground truth” annotations tenuous. Synthesizing these observations, this paper offers possible remedies going forward, and concludes with some perspectives on the future of both ACE research and the field at large.
AB - Automatic chord estimation (ACE) is a hallmark research topic in content-based music informatics, but like many other tasks, system performance appears to be converging to yet another glass ceiling. Looking toward trends in other machine perception domains, one might conclude that complex, data-driven methods have the potential to significantly advance the state of the art. Two recent efforts did exactly this for large-vocabulary ACE, but despite arguably achieving some of the highest results to date, both approaches plateau well short of having solved the problem. Therefore, this work explores the behavior of these two high performing, systems as a means of understanding obstacles and limitations in chord estimation, arriving at four critical observations: one, music recordings that invalidate tacit assumptions about harmony and tonality result in erroneous and even misleading performance; two, standard lexicons and comparison methods struggle to reflect the natural relationships between chords; three, conventional approaches conflate the competing goals of recognition and transcription to some undefined degree; and four, the perception of chords in real music can be highly subjective, making the very notion of “ground truth” annotations tenuous. Synthesizing these observations, this paper offers possible remedies going forward, and concludes with some perspectives on the future of both ACE research and the field at large.
UR - http://www.scopus.com/inward/record.url?scp=85066087545&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85066087545&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85066087545
T3 - Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015
SP - 673
EP - 679
BT - Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015
A2 - Muller, Meinard
A2 - Wiering, Frans
PB - International Society for Music Information Retrieval
T2 - 16th International Society for Music Information Retrieval Conference, ISMIR 2015
Y2 - 26 October 2015 through 30 October 2015
ER -