Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement

Hyojung Han, Seokchan Ahn, Yoonjung Choi, Insoo Chung, Sangha Kim, Kyunghyun Cho

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recent work in simultaneous machine translation is often trained with conventional full sentence translation corpora, leading to either excessive latency or necessity to anticipate as-yet-unarrived words, when dealing with a language pair whose word orders significantly differ. This is unlike human simultaneous interpreters who produce largely monotonic translations at the expense of the grammaticality of a sentence being translated. In this paper, we thus propose an algorithm to reorder and refine the target side of a full sentence translation corpus, so that the words/phrases between the source and target sentences are aligned largely monotonically, using word alignment and non-autoregressive neural machine translation. We then train a widely used wait-k simultaneous translation model on this reordered- and-refined corpus. The proposed approach improves BLEU scores and resulting translations exhibit enhanced monotonicity with source sentences.

Original languageEnglish (US)
Title of host publicationWMT 2021 - 6th Conference on Machine Translation, Proceedings
PublisherAssociation for Computational Linguistics (ACL)
Pages1110-1123
Number of pages14
ISBN (Electronic)9781954085947
StatePublished - 2021
Event6th Conference on Machine Translation, WMT 2021 - Virtual, Online, Dominican Republic
Duration: Nov 10 2021Nov 11 2021

Publication series

NameWMT 2021 - 6th Conference on Machine Translation, Proceedings

Conference

Conference6th Conference on Machine Translation, WMT 2021
Country/TerritoryDominican Republic
CityVirtual, Online
Period11/10/2111/11/21

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement'. Together they form a unique fingerprint.

Cite this