TY - GEN
T1 - Using shallow syntax information to improve word alignment and reordering for SMT
AU - Crego, Josep M.
AU - Habash, Nizar
N1 - Funding Information:
The first author has been partially funded by the Spanish Government under the AVIVAVOZ project (TEC2006-13694-C03) the Catalan Government under BE-2007 grant and the Universitat Politècnica de Catalunya under UPC-RECERCA grant. The second author was funded under the DARPA GALE program, contract HR0011-06-C-0023.
Publisher Copyright:
© 2008 Association for Computational Linguistics
PY - 2008
Y1 - 2008
N2 - We describe two methods to improve SMT accuracy using shallow syntax information. First, we use chunks to refine the set of word alignments typically used as a starting point in SMT systems. Second, we extend an N -gram-based SMT system with chunk tags to better account for long-distance reorderings. Experiments are reported on an Arabic-English task showing significant improvements. A human error analysis indicates that long-distance reorderings are captured effectively.
AB - We describe two methods to improve SMT accuracy using shallow syntax information. First, we use chunks to refine the set of word alignments typically used as a starting point in SMT systems. Second, we extend an N -gram-based SMT system with chunk tags to better account for long-distance reorderings. Experiments are reported on an Arabic-English task showing significant improvements. A human error analysis indicates that long-distance reorderings are captured effectively.
UR - http://www.scopus.com/inward/record.url?scp=84857788222&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84857788222&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84857788222
T3 - 3rd Workshop on Statistical Machine Translation, WMT 2008 at the Annual Meeting of the Association for Computational Linguistics, ACL 2008
SP - 53
EP - 61
BT - 3rd Workshop on Statistical Machine Translation, WMT 2008 at the Annual Meeting of the Association for Computational Linguistics, ACL 2008
PB - Association for Computational Linguistics (ACL)
T2 - 3rd Workshop on Statistical Machine Translation, WMT 2008 at the Annual Meeting of the Association for Computational Linguistics, ACL 2008
Y2 - 19 June 2008
ER -