Challenges in building an Arabic-English GHMT system with SMT components

Nizar Habash, Bonnie Dorr, Christof Monz

Research output: Contribution to conferencePaperpeer-review

Abstract

The research context of this paper is developing hybrid machine translation (MT) systems that exploit the advantages of linguistic rule-based and statistical MT systems. Arabic, as a morphologically rich language, is especially challenging even without addressing the hybridization question. In this paper, we describe the challenges in building an Arabic- English generation-heavy machine translation (GHMT) system and boosting it with statistical machine translation (SMT) components. We present an extensive evaluation of multiple system variants and report positive results on the advantages of hybridization.

Original languageEnglish (US)
Pages56-65
Number of pages10
StatePublished - 2006
Event7th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2006 - Cambridge, MA, United States
Duration: Aug 8 2006Aug 12 2006

Other

Other7th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2006
Country/TerritoryUnited States
CityCambridge, MA
Period8/8/068/12/06

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'Challenges in building an Arabic-English GHMT system with SMT components'. Together they form a unique fingerprint.

Cite this