TY - JOUR
T1 - Multi-way, multilingual neural machine translation
AU - Firat, Orhan
AU - Cho, Kyunghyun
AU - Sankaran, Baskaran
AU - Yarman Vural, Fatos T.
AU - Bengio, Yoshua
N1 - Publisher Copyright:
© 2017 Elsevier Ltd
PY - 2017/9
Y1 - 2017/9
N2 - We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT′15 simultaneously and observe clear performance improvements over models trained on only one language pair. We empirically evaluate the proposed model on low-resource language translation tasks. In particular, we observe that the proposed multilingual model outperforms strong conventional statistical machine translation systems on Turkish-English and Uzbek-English by incorporating the resources of other language pairs.
AB - We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT′15 simultaneously and observe clear performance improvements over models trained on only one language pair. We empirically evaluate the proposed model on low-resource language translation tasks. In particular, we observe that the proposed multilingual model outperforms strong conventional statistical machine translation systems on Turkish-English and Uzbek-English by incorporating the resources of other language pairs.
KW - Low resource translation
KW - Multi-lingual
KW - Neural machine translation
UR - http://www.scopus.com/inward/record.url?scp=85008205728&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85008205728&partnerID=8YFLogxK
U2 - 10.1016/j.csl.2016.10.006
DO - 10.1016/j.csl.2016.10.006
M3 - Article
AN - SCOPUS:85008205728
SN - 0885-2308
VL - 45
SP - 236
EP - 252
JO - Computer Speech and Language
JF - Computer Speech and Language
ER -