TY - GEN
T1 - Feature-rich continuous language models for speech recognition
AU - Mirowski, Piotr
AU - Chopra, Sumit
AU - Balakrishnan, Suhrid
AU - Bangalore, Srinivas
PY - 2010
Y1 - 2010
N2 - State-of-the-art probabilistic models of text such as n-grams require an exponential number of examples as the size of the context grows, a problem that is due to the discrete word representation. We propose to solve this problem by learning a continuous-valued and lowdimensional mapping of words, and base our predictions for the probabilities of the target word on non-linear dynamics of the latent space representation of the words in context window. We build on neural networks-based language models; by expressing them as energy-based models, we can further enrich the models with additional inputs such as part-of-speech tags, topic information and graphs of word similarity. We demonstrate a significantly lower perplexity on different text corpora, as well as improved word accuracy rate on speech recognition tasks, as compared to Kneser-Ney backoff n-gram-based language models.
AB - State-of-the-art probabilistic models of text such as n-grams require an exponential number of examples as the size of the context grows, a problem that is due to the discrete word representation. We propose to solve this problem by learning a continuous-valued and lowdimensional mapping of words, and base our predictions for the probabilities of the target word on non-linear dynamics of the latent space representation of the words in context window. We build on neural networks-based language models; by expressing them as energy-based models, we can further enrich the models with additional inputs such as part-of-speech tags, topic information and graphs of word similarity. We demonstrate a significantly lower perplexity on different text corpora, as well as improved word accuracy rate on speech recognition tasks, as compared to Kneser-Ney backoff n-gram-based language models.
KW - Natural language
KW - Neural networks
KW - Probability
KW - Speech recognition
UR - http://www.scopus.com/inward/record.url?scp=79951776389&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79951776389&partnerID=8YFLogxK
U2 - 10.1109/SLT.2010.5700858
DO - 10.1109/SLT.2010.5700858
M3 - Conference contribution
AN - SCOPUS:79951776389
SN - 9781424479030
T3 - 2010 IEEE Workshop on Spoken Language Technology, SLT 2010 - Proceedings
SP - 241
EP - 246
BT - 2010 IEEE Workshop on Spoken Language Technology, SLT 2010 - Proceedings
T2 - 2010 IEEE Workshop on Spoken Language Technology, SLT 2010
Y2 - 12 December 2010 through 15 December 2010
ER -