TY - JOUR
T1 - Video coding using 3D dual-tree wavelet transform
AU - Wang, Beibei
AU - Wang, Yao
AU - Selesnick, Ivan
AU - Vetro, Anthony
PY - 2007
Y1 - 2007
N2 - This work investigates the use of the 3D dual-tree discrete wavelet transform (DDWT) for video coding. The 3D DDWT is an attractive video representation because it isolates image patterns with different spatial orientations and motion directions and speeds in separate subbands. However, it is an overcomplete transform with 4 : 1 redundancy when only real parts are used. We apply the noise-shaping algorithm proposed by Kingsbury to reduce the number of coefficients. To code the remaining significant coefficients, we propose two video codecs. The first one applies separate 3D set partitioning in hierarchical trees (SPIHT) on each subset of the DDWT coefficients (each forming a standard isotropic tree). The second codec exploits the correlation between redundant subbands, and codes the subbands jointly. Both codecs do not require motion compensation and provide better performance than the 3D SPIHT codec using the standard DWT, both objectively and subjectively. Furthermore, both codecs provide full scalability in spatial, temporal, and quality dimensions. Besides the standard isotropic decomposition, we propose an anisotropic DDWT, which extends the superiority of the normal DDWT with more directional subbands without adding to the redundancy. This anisotropic structure requires significantly fewer coefficients to represent a video after noise shaping.Finally, we also explore the benefits of combining the 3D DDWT with the standard DWT to capture a wider set of orientations.
AB - This work investigates the use of the 3D dual-tree discrete wavelet transform (DDWT) for video coding. The 3D DDWT is an attractive video representation because it isolates image patterns with different spatial orientations and motion directions and speeds in separate subbands. However, it is an overcomplete transform with 4 : 1 redundancy when only real parts are used. We apply the noise-shaping algorithm proposed by Kingsbury to reduce the number of coefficients. To code the remaining significant coefficients, we propose two video codecs. The first one applies separate 3D set partitioning in hierarchical trees (SPIHT) on each subset of the DDWT coefficients (each forming a standard isotropic tree). The second codec exploits the correlation between redundant subbands, and codes the subbands jointly. Both codecs do not require motion compensation and provide better performance than the 3D SPIHT codec using the standard DWT, both objectively and subjectively. Furthermore, both codecs provide full scalability in spatial, temporal, and quality dimensions. Besides the standard isotropic decomposition, we propose an anisotropic DDWT, which extends the superiority of the normal DDWT with more directional subbands without adding to the redundancy. This anisotropic structure requires significantly fewer coefficients to represent a video after noise shaping.Finally, we also explore the benefits of combining the 3D DDWT with the standard DWT to capture a wider set of orientations.
UR - http://www.scopus.com/inward/record.url?scp=34247220373&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34247220373&partnerID=8YFLogxK
U2 - 10.1155/2007/42761
DO - 10.1155/2007/42761
M3 - Article
AN - SCOPUS:34247220373
SN - 1687-5176
VL - 2007
JO - Eurasip Journal on Image and Video Processing
JF - Eurasip Journal on Image and Video Processing
M1 - 42761
ER -