TY - GEN
T1 - Downbeat tracking with multiple features and deep neural networks
AU - Durand, Simon
AU - Bello, Juan P.
AU - David, Bertrand
AU - Richard, Gael
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/8/4
Y1 - 2015/8/4
N2 - In this paper, we introduce a novel method for the automatic estimation of downbeat positions from music signals. Our system relies on the computation of musically inspired features capturing important aspects of music such as timbre, harmony, rhythmic patterns, or local similarities in both timbre and harmony. It then uses several independent deep neural networks to learn higher-level representations. The downbeat sequences are finally obtained thanks to a temporal decoding step based on the Viterbi algorithm. The comparative evaluation conducted on varied datasets demonstrates the efficiency and robustness across different music styles of our approach.
AB - In this paper, we introduce a novel method for the automatic estimation of downbeat positions from music signals. Our system relies on the computation of musically inspired features capturing important aspects of music such as timbre, harmony, rhythmic patterns, or local similarities in both timbre and harmony. It then uses several independent deep neural networks to learn higher-level representations. The downbeat sequences are finally obtained thanks to a temporal decoding step based on the Viterbi algorithm. The comparative evaluation conducted on varied datasets demonstrates the efficiency and robustness across different music styles of our approach.
KW - Deep Networks
KW - Downbeat Tracking
KW - Music Information Retrieval
KW - Music Signal Processing
UR - http://www.scopus.com/inward/record.url?scp=84946023293&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84946023293&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2015.7178001
DO - 10.1109/ICASSP.2015.7178001
M3 - Conference contribution
AN - SCOPUS:84946023293
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 409
EP - 413
BT - 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015
Y2 - 19 April 2014 through 24 April 2014
ER -