DjVu: Analyzing and compressing scanned documents for Internet distribution

Patrick Haffner, Léon Bottou, Paul G. Howard, Yann LeCun

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

DjVu is an image compression technique specifically geared towards the compression of scanned documents in color at high resolution. Typical color magazine pages scanned at 300 dpi are compressed to between 40 and 80 kBytes, or 5 to 10 times smaller than with JPEG for a similar level of subjective quality. The foreground layer, which contains the text and drawings and requires high spatial resolution, is separated from the background layer, which contains pictures and backgrounds and requires less resolution. The foreground is compressed with a bi-tonal image compression technique that takes advantage of character shape similarities. The background is compressed with a new progressive, wavelet-based compression method. A real-time, memory-efficient version of the decoder is available as a plug-in for popular Web browsers.

Original languageEnglish (US)
Title of host publicationProceedings of the 5th International Conference on Document Analysis and Recognition, ICDAR 1999
PublisherIEEE Computer Society
Pages629-632
Number of pages4
ISBN (Electronic)0769503187
DOIs
StatePublished - 1999
Event5th International Conference on Document Analysis and Recognition, ICDAR 1999 - Bangalore, India
Duration: Sep 20 1999Sep 22 1999

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN (Print)1520-5363

Other

Other5th International Conference on Document Analysis and Recognition, ICDAR 1999
Country/TerritoryIndia
CityBangalore
Period9/20/999/22/99

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'DjVu: Analyzing and compressing scanned documents for Internet distribution'. Together they form a unique fingerprint.

Cite this