A study of using an Out-Of-Box commercial MT system for query translation in CLIR

Dan Wu, Daqing He, Heng Ji, Ralph Grishman

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recent availability of commercial online machine translation (MT) systems makes it possible for layman Web users to utilize the MT capability for cross-language information retrieval (CLIR). To study the effectiveness of using MT for query translation, we conducted a set of experiments using Google Translate, an online MT system provided by Google, for translating queries in CLIR. The experiments show that MT is an excellent tool for the query translation task, and with the help of relevance feedback, it can achieve significant improvement over the monolingual baseline. The MT based query translation not only works for long queries, but is also effective for the short Web queries.

Original languageEnglish (US)
Title of host publicationProceeding of the 2nd ACM Workshop on Improving Non English Web Searching, iNEWS'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM 2008
Pages71-76
Number of pages6
DOIs
StatePublished - 2008
Event2nd ACM Workshop on Improving Non English Web Searching, iNEWS'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM 2008 - Napa Valley, CA, United States
Duration: Oct 30 2008Oct 30 2008

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Other

Other2nd ACM Workshop on Improving Non English Web Searching, iNEWS'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM 2008
Country/TerritoryUnited States
CityNapa Valley, CA
Period10/30/0810/30/08

Keywords

  • Cross-language information retrieval
  • Machine translation
  • Query expansion
  • Query translation
  • Relevance feedback

ASJC Scopus subject areas

  • General Decision Sciences
  • General Business, Management and Accounting

Fingerprint

Dive into the research topics of 'A study of using an Out-Of-Box commercial MT system for query translation in CLIR'. Together they form a unique fingerprint.

Cite this