TY - JOUR
T1 - Image coding using dual-tree discrete wavelet transform
AU - Yang, Jingyu
AU - Wang, Yao
AU - Xu, Wenli
AU - Dai, Qionghai
N1 - Funding Information:
Yao Wang (M’90–SM’98–F’04) received the B.S. and M.S. degrees in electronic engineering from Tsinghua University, Beijing, China, in 1983 and 1985, respectively, and the Ph.D. degree in electrical and computer engineering from University of Cali-fornia, Santa Barbara, in 1990. Since 1990, she has been with the faculty of Poly-technic University, Brooklyn, NY, and is presently a Professor of electrical and computer engineering. She was on sabbatical leave from Princeton Univer-sity, Princeton, NJ, in 1998, and from the Thomson Corporate Research, Princeton, from 2004–2005. She was a Consultant with AT&T Laboratories-Research, formerly AT&T Bell Laboratories, from 1992 to 2000. Her research areas include video communications, multimedia signal processing, and medical imaging. She is the leading author of a textbook titled Video Processing and Communications (Englewood Cliffs, NJ: Prentice-Hall, 2002) and has published over 150 papers in journals and conference proceedings. Dr. Wang has served as an Associate Editor for the IEEE TRANSACTIONS ON MULTIMEDIA and the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY. She received the New York City Mayor’s Award for Excellence in Science and Technology in the Young Investigator Category in year 2000. She was elected Fellow of the IEEE in 2004 for contributions to video processing and communications. She is a co-winner of the IEEE Communications Society Leonard G. Abraham Prize Paper Award in the Field of Communications Systems in 2004. She received the Oversea Outstanding Young Investigator Award from the National Natural Science Foundation of China (NSFC) in 2005 and the Yangtze River Scholar Award from the Ministry of Education of China in 2008.
Funding Information:
Manuscript received November 26, 2007; revised April 23, 2008. First published July 9, 2008; last published August 13, 2008 (projected). This work was supported in part by the Joint Research Fund for Overseas Chinese Young Scholars of NSFC (Grant 60528004), in part by the Distinguished Young Scholars of NSFC (Grant 60525111), and in part by the key project of NSFC (Grant 60432030). The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Kai-Kuang Ma.
PY - 2008
Y1 - 2008
N2 - In this paper, we explore the application of 2-D dual-tree discrete wavelet transform (DDWT), which is a directional and redundant transform, for image coding. Three methods for sparsifying DDWT coefficients, i.e., matching pursuit, basis pursuit, and noise shaping, are compared. We found that noise shaping achieves the best nonlinear approximation efficiency with the lowest computational complexity. The interscale, intersubband, and intrasubband dependency among the DDWT coefficients are analyzed. Three subband coding methods, i.e., SPIHT, EBCOT, and TCE, are evaluated for coding DDWT coefficients. Experimental results show that TCE has the best performance. In spite of the redundancy of the transform, our DDWT_TCE scheme outperforms JPEG2000 up to 0.70 dB at low bit rates and is comparable to JPEG2000 at high bit rates. The DDWT_TCE scheme also outperforms two other image coders that are based on directional filter banks. To further improve coding efficiency, we extend the DDWT to an anisotropic dual-tree discrete wavelet packets (ADDWP), which incorporates adaptive and anisotropic decomposition into DDWT. The ADDWP subbands are coded with TCE coder. Experimental results show that ADDWP_TCE provides up to 1.47 dB improvement over the DDWT_TCE scheme, outperforming JPEG2000 up to 2.00 dB. Reconstructed images of our coding schemes are visually more appealing compared with DWT-based coding schemes thanks to the directionality of wavelets.
AB - In this paper, we explore the application of 2-D dual-tree discrete wavelet transform (DDWT), which is a directional and redundant transform, for image coding. Three methods for sparsifying DDWT coefficients, i.e., matching pursuit, basis pursuit, and noise shaping, are compared. We found that noise shaping achieves the best nonlinear approximation efficiency with the lowest computational complexity. The interscale, intersubband, and intrasubband dependency among the DDWT coefficients are analyzed. Three subband coding methods, i.e., SPIHT, EBCOT, and TCE, are evaluated for coding DDWT coefficients. Experimental results show that TCE has the best performance. In spite of the redundancy of the transform, our DDWT_TCE scheme outperforms JPEG2000 up to 0.70 dB at low bit rates and is comparable to JPEG2000 at high bit rates. The DDWT_TCE scheme also outperforms two other image coders that are based on directional filter banks. To further improve coding efficiency, we extend the DDWT to an anisotropic dual-tree discrete wavelet packets (ADDWP), which incorporates adaptive and anisotropic decomposition into DDWT. The ADDWP subbands are coded with TCE coder. Experimental results show that ADDWP_TCE provides up to 1.47 dB improvement over the DDWT_TCE scheme, outperforming JPEG2000 up to 2.00 dB. Reconstructed images of our coding schemes are visually more appealing compared with DWT-based coding schemes thanks to the directionality of wavelets.
KW - Anisotropic decomposition
KW - Image coding
KW - Redundant transform
KW - Sparse representation
KW - Wavelet transform
UR - http://www.scopus.com/inward/record.url?scp=50549099323&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50549099323&partnerID=8YFLogxK
U2 - 10.1109/TIP.2008.926159
DO - 10.1109/TIP.2008.926159
M3 - Article
C2 - 18701394
AN - SCOPUS:50549099323
SN - 1057-7149
VL - 17
SP - 1555
EP - 1569
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 9
ER -