Learning Non-Monotonic Automatic Post-Editing of Translations from Human Orderings

Antonio Ǵois, Kyunghyun Cho, Andŕe Martins

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recent research in neural machine translation has explored flexible generation orders, as an alternative to left-to-right generation. However, training non-monotonic models brings a new complication: how to search for a good ordering when there is a combinatorial explosion of orderings arriving at the same final result? Also, how do these automatic orderings compare with the actual behaviour of human translators? Current models rely on manually built biases or are left to explore all possibilities on their own. In this paper, we analyze the orderings produced by human post-editors and use them to train an automatic postediting system. We compare the resulting system with those trained with left-to-right and random post-editing orderings. We observe that humans tend to follow a nearly left-to-right order, but with interesting deviations, such as preferring to start by correcting punctuation or verbs.

Original languageEnglish (US)
Title of host publicationProceedings of the 22nd Annual Conference of the European Association for Machine Translation, EAMT 2020
EditorsAndre Martins, Helena Moniz, Sara Fumega, Bruno Martins, Fernando Batista, Luisa Coheur, Carla Parra Escartiin, Isabel Trancoso, Marco Turchi, Arianna Bisazza, Joss Moorkens, Ana Guerberof, Mary Nurminen, Lena Marg, Mikel L. Forcada
PublisherEuropean Association for Machine Translation
Pages205-214
Number of pages10
ISBN (Electronic)9789893305898
StatePublished - 2020
Event22nd Annual Conference of the European Association for Machine Translation, EAMT 2020 - Virtual, Lisbon, Portugal
Duration: Nov 3 2020Nov 5 2020

Publication series

NameProceedings of the 22nd Annual Conference of the European Association for Machine Translation, EAMT 2020

Conference

Conference22nd Annual Conference of the European Association for Machine Translation, EAMT 2020
Country/TerritoryPortugal
CityVirtual, Lisbon
Period11/3/2011/5/20

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Language and Linguistics
  • Software

Fingerprint

Dive into the research topics of 'Learning Non-Monotonic Automatic Post-Editing of Translations from Human Orderings'. Together they form a unique fingerprint.

Cite this