TY - GEN
T1 - A parallel corpus for evaluating machine translation between Arabic and european languages
AU - Habash, Nizar
AU - Zalmout, Nasser
AU - Taji, Dima
AU - Hieu, Hoang
AU - Alzate, Maverick
N1 - Publisher Copyright:
© 2017 Association for Computational Linguistics.
PY - 2017
Y1 - 2017
N2 - We present Arab-Acquis, a large publicly available dataset for evaluating machine translation between 22 European languages and Arabic. Arab-Acquis consists of over 12,000 sentences from the JRCAcquis (Acquis Communautaire) corpus translated twice by professional translators, once from English and once from French, and totaling over 600,000 words. The corpus follows previous data splits in the literature for tuning, development, and testing. We describe the corpus and how it was created. We also present the first benchmarking results on translating to and from Arabic for 22 European languages.
AB - We present Arab-Acquis, a large publicly available dataset for evaluating machine translation between 22 European languages and Arabic. Arab-Acquis consists of over 12,000 sentences from the JRCAcquis (Acquis Communautaire) corpus translated twice by professional translators, once from English and once from French, and totaling over 600,000 words. The corpus follows previous data splits in the literature for tuning, development, and testing. We describe the corpus and how it was created. We also present the first benchmarking results on translating to and from Arabic for 22 European languages.
UR - http://www.scopus.com/inward/record.url?scp=85021665456&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85021665456&partnerID=8YFLogxK
U2 - 10.18653/v1/e17-2038
DO - 10.18653/v1/e17-2038
M3 - Conference contribution
AN - SCOPUS:85021665456
T3 - 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference
SP - 235
EP - 241
BT - Short Papers
PB - Association for Computational Linguistics (ACL)
T2 - 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017
Y2 - 3 April 2017 through 7 April 2017
ER -