TY - GEN
T1 - Convolutional recurrent neural networks for music classification
AU - Choi, Keunwoo
AU - Fazekas, Gyorgy
AU - Sandler, Mark
AU - Cho, Kyunghyun
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/6/16
Y1 - 2017/6/16
N2 - We introduce a convolutional recurrent neural network (CRNN) for music tagging. CRNNs take advantage of convolutional neural networks (CNNs) for local feature extraction and recurrent neural networks for temporal summarisation of the extracted features. We compare CRNN with three CNN structures that have been used for music tagging while controlling the number of parameters with respect to their performance and training time per sample. Overall, we found that CRNNs show a strong performance with respect to the number of parameter and training time, indicating the effectiveness of its hybrid structure in music feature extraction and feature summarisation.
AB - We introduce a convolutional recurrent neural network (CRNN) for music tagging. CRNNs take advantage of convolutional neural networks (CNNs) for local feature extraction and recurrent neural networks for temporal summarisation of the extracted features. We compare CRNN with three CNN structures that have been used for music tagging while controlling the number of parameters with respect to their performance and training time per sample. Overall, we found that CRNNs show a strong performance with respect to the number of parameter and training time, indicating the effectiveness of its hybrid structure in music feature extraction and feature summarisation.
KW - convolutional neural networks
KW - music classification
KW - recurrent neural networks
UR - http://www.scopus.com/inward/record.url?scp=85023756452&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85023756452&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2017.7952585
DO - 10.1109/ICASSP.2017.7952585
M3 - Conference contribution
AN - SCOPUS:85023756452
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 2392
EP - 2396
BT - 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017
Y2 - 5 March 2017 through 9 March 2017
ER -