TY - GEN
T1 - Generating sentences from a continuous space
AU - Bowman, Samuel R.
AU - Vilnis, Luke
AU - Vinyals, Oriol
AU - Dai, Andrew M.
AU - Jozefowicz, Rafal
AU - Bengio, Samy
PY - 2016/1/1
Y1 - 2016/1/1
N2 - The standard recurrent neural network language model (rnnlm) generates sentences one word at a time and does not work from an explicit global sentence representation. In this work, we introduce and study an rnn-based variational autoencoder generative model that incorporates distributed latent representations of entire sentences. This factorization allows it to explicitly model holistic properties of sentences such as style, topic, and high-level syntactic features. Samples from the prior over these sentence representations remarkably produce diverse and well-formed sentences through simple deterministic decoding. By examining paths through this latent space, we are able to generate coherent novel sentences that interpolate between known sentences. We present techniques for solving the difficult learning problem presented by this model, demonstrate its effectiveness in imputing missing words, explore many interesting properties of the model’s latent sentence space, and present negative results on the use of the model in language modeling.
AB - The standard recurrent neural network language model (rnnlm) generates sentences one word at a time and does not work from an explicit global sentence representation. In this work, we introduce and study an rnn-based variational autoencoder generative model that incorporates distributed latent representations of entire sentences. This factorization allows it to explicitly model holistic properties of sentences such as style, topic, and high-level syntactic features. Samples from the prior over these sentence representations remarkably produce diverse and well-formed sentences through simple deterministic decoding. By examining paths through this latent space, we are able to generate coherent novel sentences that interpolate between known sentences. We present techniques for solving the difficult learning problem presented by this model, demonstrate its effectiveness in imputing missing words, explore many interesting properties of the model’s latent sentence space, and present negative results on the use of the model in language modeling.
UR - http://www.scopus.com/inward/record.url?scp=85072753030&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85072753030&partnerID=8YFLogxK
M3 - Conference contribution
T3 - CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings
SP - 10
EP - 21
BT - CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings
PB - Association for Computational Linguistics (ACL)
T2 - 20th SIGNLL Conference on Computational Natural Language Learning, CoNLL 2016
Y2 - 11 August 2016 through 12 August 2016
ER -