TY - GEN
T1 - Video coding using 3-D dual-tree discrete wavelet transforms
AU - Wang, Beibei
AU - Wang, Yao
AU - Selesnick, Ivan
AU - Vetro, Anthony
PY - 2005
Y1 - 2005
N2 - This paper explores the use of a recently introduced 3-D dual-tree discrete wavelet transform (DDWT) for video coding. The 3-D DDWT is an attractive video representation because it isolates motion along different directions in separate subbands. However, it is an overcomplete transform with 8:1 or 4:1 redundancy. Based on the effectiveness of the iterative projection-based noise shaping scheme proposed by Kingsbury on reducing the number of coefficients, and our prior investigation about the correlation between subbands at the same spatial/temporal location, both in the significance map and in actual coefficient values, a new video coding scheme using 3D DDWT is proposed. The proposed video codec does not require motion compensation and provides better performance than the 3D SPIHT codec, both objectively and subjectively, despite the fact that the raw number of coefficients resulting from the 3-D DDWT is much more than that of the conventional 3-D DWT. The proposed coder allows full scalability in spatial, temporal and quality dimensions.
AB - This paper explores the use of a recently introduced 3-D dual-tree discrete wavelet transform (DDWT) for video coding. The 3-D DDWT is an attractive video representation because it isolates motion along different directions in separate subbands. However, it is an overcomplete transform with 8:1 or 4:1 redundancy. Based on the effectiveness of the iterative projection-based noise shaping scheme proposed by Kingsbury on reducing the number of coefficients, and our prior investigation about the correlation between subbands at the same spatial/temporal location, both in the significance map and in actual coefficient values, a new video coding scheme using 3D DDWT is proposed. The proposed video codec does not require motion compensation and provides better performance than the 3D SPIHT codec, both objectively and subjectively, despite the fact that the raw number of coefficients resulting from the 3-D DDWT is much more than that of the conventional 3-D DWT. The proposed coder allows full scalability in spatial, temporal and quality dimensions.
UR - http://www.scopus.com/inward/record.url?scp=33646786802&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33646786802&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2005.1415341
DO - 10.1109/ICASSP.2005.1415341
M3 - Conference contribution
AN - SCOPUS:33646786802
SN - 0780388747
SN - 9780780388741
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 61
EP - 64
BT - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
Y2 - 18 March 2005 through 23 March 2005
ER -