Split Computing With Scalable Feature Compression for Visual Analytics on the Edge

Zhongzheng Yuan, Samyak Rawlekar, Siddharth Garg, Elza Erkip, Yao Wang

Research output: Contribution to journalArticlepeer-review


Running deep visual analytics models for real-time applications is challenging for mobile devices. Offloading the computation to edge server can mitigate computation bottleneck at the mobile device, but may decrease the analytics performance due to the necessity of compressing the image data. We consider a “split computing” system to offload a part of the deep learning model's computation and introduce a novel learned feature compression approach with lightweight computation. We demonstrate the effectiveness of the split computing pipeline in performing computation offloading for the problems of object detection and image classification. Compared to compressing the raw images at the mobile, and running the analytics model on the decompressed images at the server, the proposed feature-compression approach can achieve significantly higher analytics performance at the same bit rate, while reducing the complexity at the mobile. We further propose a scalable feature compression approach, which facilitates adaptation to network bandwidth dynamics, while having comparable performance to the non-scalable approach.

Original languageEnglish (US)
Pages (from-to)1-13
Number of pages13
JournalIEEE Transactions on Multimedia
StateAccepted/In press - 2024


  • Analytical models
  • Bit rate
  • Computational modeling
  • Computer Vision
  • Feature Compression
  • Image coding
  • Object Detection
  • Performance evaluation
  • Servers
  • Split Computing
  • Task analysis

ASJC Scopus subject areas

  • Signal Processing
  • Media Technology
  • Computer Science Applications
  • Electrical and Electronic Engineering


Dive into the research topics of 'Split Computing With Scalable Feature Compression for Visual Analytics on the Edge'. Together they form a unique fingerprint.

Cite this