Abstract
This paper considers how to separate text and/or graphics from smooth background in screen content and mixed document images and proposes two approaches to perform this segmentation task. The proposed methods make use of the fact that the background in each block is usually smoothly varying and can be modeled well by a linear combination of a few smoothly varying basis functions, while the foreground text and graphics create sharp discontinuity. The algorithms separate the background and foreground pixels by trying to fit background pixel values in the block into a smooth function using two different schemes. One is based on robust regression, where the inlier pixels will be considered as background, while remaining outlier pixels will be considered foreground. The second approach uses a sparse decomposition framework where the background and foreground layers are modeled with a smooth and sparse components respectively. These algorithms have been tested on images extracted from HEVC standard test sequences for screen content coding, and are shown to have superior performance over previous approaches. The proposed methods can be used in different applications such as text extraction, separate coding of background and foreground for compression of screen content, and medical image segmentation.
Original language | English (US) |
---|---|
Article number | 7542596 |
Pages (from-to) | 573-584 |
Number of pages | 12 |
Journal | IEEE Journal on Emerging and Selected Topics in Circuits and Systems |
Volume | 6 |
Issue number | 4 |
DOIs | |
State | Published - Dec 2016 |
Keywords
- Alternating direction method of multipliers (ADMM)
- image segmentation
- random sample consensus (RANSAC)
- robust regression
- sparse decomposition
ASJC Scopus subject areas
- Electrical and Electronic Engineering