Abstract
Given two sequences X and Y that are strings over some alphabet set, we consider the distance d(X,Y) between them defined to be minimum number of character replacements and block (substring) reversals needed to transform X to Y (or vice versa). The operations are required to be disjoint. This is the "simplest" sequence comparison problem we know of that allows natural block edit operations. Block reversals arise naturally in genomic sequence comparison; they are also of interest in matching music data. We present an algorithm for exactly computing the distance d(X,Y); it takes time O(|X|log 2|X|), and hence, is near-linear. Trivial approach takes quadratic time.
Original language | English (US) |
---|---|
Pages (from-to) | 95-101 |
Number of pages | 7 |
Journal | Theoretical Computer Science |
Volume | 321 |
Issue number | 1 |
DOIs | |
State | Published - Jun 16 2004 |
Event | Latin American Theoretical Informatics - Cancun, Mexico Duration: Apr 3 2002 → Apr 6 2002 |
Keywords
- Block edit distance
- Sequence comparison
- String periodicity
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science