TY - GEN
T1 - Fuzzy Syntactic Reordering for Phrase-based Statistical Machine Translation
AU - Andreas, Jacob
AU - Habash, Nizar
AU - Rambow, Owen
N1 - Publisher Copyright:
© 2011 Association for Computational Linguistics
PY - 2011
Y1 - 2011
N2 - The quality of Arabic-English statistical machine translation often suffers as a result of standard phrase-based SMT systems' inability to perform long-range re-orderings, specifically those needed to translate VSO-ordered Arabic sentences. This problem is further exacerbated by the low performance of Arabic parsers on subject and subject span detection. In this paper, we present two parse “fuzzification” techniques which allow the translation system to select among a range of possible S-V re-orderings. With this approach, we demonstrate a 0.3-point improvement in BLEU score (69% of the maximum possible using gold parses), and a corresponding improvement in the percentage of syntactically well-formed subjects under a manual evaluation.
AB - The quality of Arabic-English statistical machine translation often suffers as a result of standard phrase-based SMT systems' inability to perform long-range re-orderings, specifically those needed to translate VSO-ordered Arabic sentences. This problem is further exacerbated by the low performance of Arabic parsers on subject and subject span detection. In this paper, we present two parse “fuzzification” techniques which allow the translation system to select among a range of possible S-V re-orderings. With this approach, we demonstrate a 0.3-point improvement in BLEU score (69% of the maximum possible using gold parses), and a corresponding improvement in the percentage of syntactically well-formed subjects under a manual evaluation.
UR - http://www.scopus.com/inward/record.url?scp=84878177035&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84878177035&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84878177035
T3 - WMT 2011 - 6thWorkshop on Statistical Machine Translation, Proceedings of the Workshop
SP - 227
EP - 236
BT - WMT 2011 - 6thWorkshop on Statistical Machine Translation, Proceedings of the Workshop
PB - Association for Computational Linguistics (ACL)
T2 - 6thWorkshop on Statistical Machine Translation, WMT 2011
Y2 - 30 July 2011 through 31 July 2011
ER -