Standard Compatible Efficient Video Coding with Jointly Optimized Neural Wrappers

Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, Yao Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present a standard-compatible video coding scheme with end-to-end optimized neural wrapper over standard video codecs that achieves significant rate-distortion (R-D) performance gains and is still efficient in decoding. We train a pair of pre- and post-processor using a differential JPEG proxy. The pre-processor applies a learned transform to the video and downsamples the video by a factor of 2. It generates a bottleneck video to be coded by a standard codec as a YUV sequence. The post-processor takes the decoded bottleneck video, does the inverse transform, and upsamples it to the original resolution. We follow the design in [1] , where we configure downsample using a layer of strided convolution. We optimize the post-processor for efficiency by replacing convolutions with kernel size larger than 1×1 to depth-wise convolutions [2].

Original languageEnglish (US)
Title of host publicationProceedings - DCC 2024
Subtitle of host publication2024 Data Compression Conference
EditorsAli Bilgin, James E. Fowler, Joan Serra-Sagrista, Yan Ye, James A. Storer
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages561
Number of pages1
ISBN (Electronic)9798350385878
DOIs
StatePublished - 2024
Event2024 Data Compression Conference, DCC 2024 - Snowbird, United States
Duration: Mar 19 2024Mar 22 2024

Publication series

NameData Compression Conference Proceedings
ISSN (Print)1068-0314

Conference

Conference2024 Data Compression Conference, DCC 2024
Country/TerritoryUnited States
CitySnowbird
Period3/19/243/22/24

Keywords

  • Efficient Video Coding
  • Neural Network
  • Postprocess
  • Preprocess
  • Video Coding

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Standard Compatible Efficient Video Coding with Jointly Optimized Neural Wrappers'. Together they form a unique fingerprint.

Cite this