TY - GEN
T1 - 3-Way composition of weighted finite-state transducers
AU - Allauzen, Cyril
AU - Mohri, Mehryar
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2008
Y1 - 2008
N2 - Composition of weighted transducers is a fundamental algorithm used in many applications, including for computing complex edit-distances between automata, or string kernels in machine learning, or to combine different components of a speech recognition, speech synthesis, or information extraction system. We present a generalization of the composition of weighted transducers, 3-way composition, which is dramatically faster in practice than the standard composition algorithm when combining more than two transducers. The worst-case complexity of our algorithm for composing three transducers T 1, T 2, and T 3 resulting in T, is O(|T| Q min (d(T 1) d(T 3), d(T 2))∈+∈|T| E ), where |•| Q denotes the number of states, |•| E the number of transitions, and d(•) the maximum out-degree. As in regular composition, the use of perfect hashing requires a pre-processing step with linear-time expected complexity in the size of the input transducers. In many cases, this approach significantly improves on the complexity of standard composition. Our algorithm also leads to a dramatically faster composition in practice. Furthermore, standard composition can be obtained as a special case of our algorithm. We report the results of several experiments demonstrating this improvement. These theoretical and empirical improvements significantly enhance performance in the applications already mentioned.
AB - Composition of weighted transducers is a fundamental algorithm used in many applications, including for computing complex edit-distances between automata, or string kernels in machine learning, or to combine different components of a speech recognition, speech synthesis, or information extraction system. We present a generalization of the composition of weighted transducers, 3-way composition, which is dramatically faster in practice than the standard composition algorithm when combining more than two transducers. The worst-case complexity of our algorithm for composing three transducers T 1, T 2, and T 3 resulting in T, is O(|T| Q min (d(T 1) d(T 3), d(T 2))∈+∈|T| E ), where |•| Q denotes the number of states, |•| E the number of transitions, and d(•) the maximum out-degree. As in regular composition, the use of perfect hashing requires a pre-processing step with linear-time expected complexity in the size of the input transducers. In many cases, this approach significantly improves on the complexity of standard composition. Our algorithm also leads to a dramatically faster composition in practice. Furthermore, standard composition can be obtained as a special case of our algorithm. We report the results of several experiments demonstrating this improvement. These theoretical and empirical improvements significantly enhance performance in the applications already mentioned.
UR - http://www.scopus.com/inward/record.url?scp=50249178637&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50249178637&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-70844-5_27
DO - 10.1007/978-3-540-70844-5_27
M3 - Conference contribution
AN - SCOPUS:50249178637
SN - 354070843X
SN - 9783540708438
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 262
EP - 273
BT - Implementation and Application of Automata - 13th International Conference, CIAA 2008, Proceedings
T2 - 13th International Conference on Implementation and Application of Automata, CIAA 2008
Y2 - 21 July 2008 through 24 July 2008
ER -