A two-level approach for subtitle alignment

Jia Huang, Hao Ding, Xiaohua Hu, Yong Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we propose a two-level Needleman-Wunsch algorithm to align two subtitle files. We consider each subtitle file as a sequence of sentences, and each sentence as a sequence of characters. Our algorithm aligns the OCR and Web subtitles from both sentence level and character level. Experiments on ten datasets from two TV shows indicate that our algorithm outperforms the state-of-the-art approaches with an average precision and recall of 0.96 and 0.95.

Original languageEnglish (US)
Title of host publicationAdvances in Information Retrieval - 36th European Conference on IR Research, ECIR 2014, Proceedings
PublisherSpringer Verlag
Pages468-473
Number of pages6
ISBN (Print)9783319060279
DOIs
StatePublished - 2014
Event36th European Conference on Information Retrieval, ECIR 2014 - Amsterdam, Netherlands
Duration: Apr 13 2014Apr 16 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8416 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other36th European Conference on Information Retrieval, ECIR 2014
Country/TerritoryNetherlands
CityAmsterdam
Period4/13/144/16/14

Keywords

  • dynamic programming
  • sequence alignment
  • subtitle alignment

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'A two-level approach for subtitle alignment'. Together they form a unique fingerprint.

Cite this