Phonetic name matching for cross-lingual spoken sentence retrieval

Heng Ji, Ralph Grishman, Wen Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Cross-lingual Spoken Sentence Retrieval (CLSSR) remains a challenge, especially for queries including OOV words such as person names. This paper proposes a simple method of fuzzy matching between query names and phones of candidate audio segments. This approach has the advantage of avoiding some word decoding errors in Automatic Speech Recognition (ASR). Experiments on Mandarin-English CLSSR show that phone-based searching and conventional translation-based searching are complementary. Adding phone matching achieved 26.29% improvement on F-measure over searching on state-of-the-art Machine Translation (MT) output and 8.83% over Entity Translation (ET) output.

Original languageEnglish (US)
Title of host publication2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings
Pages281-284
Number of pages4
DOIs
StatePublished - 2008
Event2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Goa, India
Duration: Dec 15 2008Dec 19 2008

Publication series

Name2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings

Other

Other2008 IEEE Workshop on Spoken Language Technology, SLT 2008
Country/TerritoryIndia
CityGoa
Period12/15/0812/19/08

Keywords

  • Speech Recognition, Information Retrieval

ASJC Scopus subject areas

  • Language and Linguistics
  • Software
  • Electrical and Electronic Engineering
  • Communication

Fingerprint

Dive into the research topics of 'Phonetic name matching for cross-lingual spoken sentence retrieval'. Together they form a unique fingerprint.

Cite this