TY - GEN
T1 - Recurrent orthogonal networks and long-memory tasks
AU - Henaff, Mikael
AU - Szlam, Arthur
AU - Lecun, Yann
PY - 2016
Y1 - 2016
N2 - Although RNNs have been shown to be powerful tools for processing sequential data, finding architectures or optimization strategies that allow them to model very long term dependencies is still an active area of research In this work, we carefully analyze two synthetic datasets originally outlined in (Hochreiter & Schmidhuber, 1997) which are used to evaluate the ability of RNNs to store information over many time steps. We explicitly construct RNN solutions to these problems, and using these constructions, illuminate both the problems themselves and the way in which RNNs store different types of information in their hidden states. These constructions fur-thermore explain the success of recent methods that specify unitary initializations or constraints on the transition matrices.
AB - Although RNNs have been shown to be powerful tools for processing sequential data, finding architectures or optimization strategies that allow them to model very long term dependencies is still an active area of research In this work, we carefully analyze two synthetic datasets originally outlined in (Hochreiter & Schmidhuber, 1997) which are used to evaluate the ability of RNNs to store information over many time steps. We explicitly construct RNN solutions to these problems, and using these constructions, illuminate both the problems themselves and the way in which RNNs store different types of information in their hidden states. These constructions fur-thermore explain the success of recent methods that specify unitary initializations or constraints on the transition matrices.
UR - http://www.scopus.com/inward/record.url?scp=84999048318&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84999048318&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84999048318
T3 - 33rd International Conference on Machine Learning, ICML 2016
SP - 2978
EP - 2986
BT - 33rd International Conference on Machine Learning, ICML 2016
A2 - Weinberger, Kilian Q.
A2 - Balcan, Maria Florina
PB - International Machine Learning Society (IMLS)
T2 - 33rd International Conference on Machine Learning, ICML 2016
Y2 - 19 June 2016 through 24 June 2016
ER -