TY - GEN
T1 - Feature Compression for Rate Constrained Object Detection on the Edge
AU - Yuan, Zhongzheng
AU - Rawlekar, Samyak
AU - Garg, Siddharth
AU - Erkip, Elza
AU - Wang, Yao
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Recent advances in computer vision has led to a growth of interest in deploying visual analytics model on mobile devices. However, most mobile devices have limited computing power, which prohibits them from running large scale visual analytics neural networks. An emerging approach to solve this problem is to offload the computation of these neural networks to computing resources at an edge server. Efficient computation offloading requires optimizing the trade-off between multiple objectives including com-pressed data rate, analytics performance, and computation speed. In this work, we consider a 'split computation' system to offload a part of the computation of the YOLO object detection model. We propose a learnable feature compression approach to compress the intermediate YOLO features with light-weight computation. We train the feature compression and decompression module together with the YOLO model to optimize the object detection accuracy under a rate constraint. Compared to baseline methods that apply either standard image compression or learned image compression at the mobile and perform image de-compression and YOLO at the edge, the proposed system achieves higher detection accuracy at the low to medium rate range. Furthermore, the proposed system requires sub-stantially lower computation time on the mobile device with CPU only.
AB - Recent advances in computer vision has led to a growth of interest in deploying visual analytics model on mobile devices. However, most mobile devices have limited computing power, which prohibits them from running large scale visual analytics neural networks. An emerging approach to solve this problem is to offload the computation of these neural networks to computing resources at an edge server. Efficient computation offloading requires optimizing the trade-off between multiple objectives including com-pressed data rate, analytics performance, and computation speed. In this work, we consider a 'split computation' system to offload a part of the computation of the YOLO object detection model. We propose a learnable feature compression approach to compress the intermediate YOLO features with light-weight computation. We train the feature compression and decompression module together with the YOLO model to optimize the object detection accuracy under a rate constraint. Compared to baseline methods that apply either standard image compression or learned image compression at the mobile and perform image de-compression and YOLO at the edge, the proposed system achieves higher detection accuracy at the low to medium rate range. Furthermore, the proposed system requires sub-stantially lower computation time on the mobile device with CPU only.
KW - computation offloading
KW - data compression
KW - object detection
KW - split computing
UR - http://www.scopus.com/inward/record.url?scp=85139028718&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85139028718&partnerID=8YFLogxK
U2 - 10.1109/MIPR54900.2022.00008
DO - 10.1109/MIPR54900.2022.00008
M3 - Conference contribution
AN - SCOPUS:85139028718
T3 - Proceedings - 5th International Conference on Multimedia Information Processing and Retrieval, MIPR 2022
SP - 1
EP - 6
BT - Proceedings - 5th International Conference on Multimedia Information Processing and Retrieval, MIPR 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th International Conference on Multimedia Information Processing and Retrieval, MIPR 2022
Y2 - 2 August 2022 through 4 August 2022
ER -