A disambiguation algorithm for weighted automata

Mehryar Mohri, Michael D. Riley

Research output: Contribution to journalArticle

Abstract

We present a disambiguation algorithm for weighted automata. The algorithm admits two main stages: a pre-disambiguation stage followed by a transition removal stage. We give a detailed description of the algorithm and the proof of its correctness. The algorithm is not applicable to all weighted automata but we prove sufficient conditions for its applicability in the case of the tropical semiring by introducing the weak twins property. In particular, the algorithm can be used with any weighted automaton over the tropical semiring for which the weighted determinization algorithm terminates and with any acyclic weighted automaton over an arbitrary weakly left divisible cancellative and commutative semiring. While disambiguation can sometimes be achieved using weighted determinization, our disambiguation algorithm in some cases can return a result that is exponentially smaller than any equivalent deterministic automaton. We also present some empirical evidence of the space benefits of disambiguation over determinization in speech recognition and machine translation applications.

Original languageEnglish (US)
Pages (from-to)53-68
Number of pages16
JournalTheoretical Computer Science
Volume679
DOIs
StatePublished - May 30 2017

Keywords

  • Automata theory
  • Rational power series
  • Weighted automata
  • Weighted automata algorithms

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'A disambiguation algorithm for weighted automata'. Together they form a unique fingerprint.

  • Cite this