TY - GEN
T1 - Expected sequence similarity maximization
AU - Allauzen, Cyril
AU - Kumar, Shankar
AU - Macherey, Wolfgang
AU - Mohri, Mehryar
AU - Riley, Michael
PY - 2010
Y1 - 2010
N2 - This paper presents efficient algorithms for expected similarity maximization, which coincides with minimum Bayes decoding for a similarity-based loss function. Our algorithms are designed for similarity functions that are sequence kernels in a general class of positive definite symmetric kernels. We discuss both a general algorithm and a more efficient algorithm applicable in a common unambiguous scenario. We also describe the application of our algorithms to machine translation and report the results of experiments with several translation data sets which demonstrate a substantial speed-up. In particular, our results show a speed-up by two orders of magnitude with respect to the original method of Tromble et al. (2008) and by a factor of 3 or more even with respect to an approximate algorithm specifically designed for that task. These results open the path for the exploration of more appropriate or optimal kernels for the specific tasks considered.
AB - This paper presents efficient algorithms for expected similarity maximization, which coincides with minimum Bayes decoding for a similarity-based loss function. Our algorithms are designed for similarity functions that are sequence kernels in a general class of positive definite symmetric kernels. We discuss both a general algorithm and a more efficient algorithm applicable in a common unambiguous scenario. We also describe the application of our algorithms to machine translation and report the results of experiments with several translation data sets which demonstrate a substantial speed-up. In particular, our results show a speed-up by two orders of magnitude with respect to the original method of Tromble et al. (2008) and by a factor of 3 or more even with respect to an approximate algorithm specifically designed for that task. These results open the path for the exploration of more appropriate or optimal kernels for the specific tasks considered.
UR - http://www.scopus.com/inward/record.url?scp=84863391321&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863391321&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84863391321
SN - 1932432655
SN - 9781932432657
T3 - NAACL HLT 2010 - Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Main Conference
SP - 957
EP - 965
BT - NAACL HLT 2010 - Human Language Technologies
T2 - 2010 Human Language Technologies Conference ofthe North American Chapter of the Association for Computational Linguistics, NAACL HLT 2010
Y2 - 2 June 2010 through 4 June 2010
ER -