JPEG-matched MRC compression of compound documents

D. Mukherjee, N. Memon, A. Said

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.

Original languageEnglish (US)
Title of host publicationIEEE International Conference on Image Processing
Pages434-437
Number of pages4
Volume3
StatePublished - 2001
EventIEEE International Conference on Image Processing (ICIP) - Thessaloniki, Greece
Duration: Oct 7 2001Oct 10 2001

Other

OtherIEEE International Conference on Image Processing (ICIP)
Country/TerritoryGreece
CityThessaloniki
Period10/7/0110/10/01

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'JPEG-matched MRC compression of compound documents'. Together they form a unique fingerprint.

Cite this