Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN

Sai Qian Zhang, Jieyu Lin, Qi Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The emergence of the Internet of Things (IoT) has led to a remarkable increase in the volume of data generated at the network edge. In order to support real-time smart IoT applications, massive amounts of data generated from edge devices need to be processed using methods such as deep neural networks (DNNs) with low latency. To improve application performance and minimize resource cost, enterprises have begun to adopt Edge computing, a computation paradigm that advocates processing input data locally at the network edge. However, as edge nodes are often resource-constrained, running data-intensive DNN inference tasks on each individual edge node often incurs high latency, which seriously limits the practicality and effectiveness of this model. In this paper, we study the problem of distributed execution of inference tasks on edge clusters for Convolutional Neural Networks (CNNs), one of the most prominent models of DNN. Unlike previous work, we present Fully Decomposable Spatial Partition (FDSP), which naturally supports resource heterogeneity and dynamicity in edge computing environments. We then present a compression technique that further reduces network communication overhead. Our system, called ADCNN, provides up to 2.8 × speed up compared to state-of-the-art approaches, while achieving a competitive inference accuracy.

Original languageEnglish (US)
Title of host publicationProceedings of the 49th International Conference on Parallel Processing, ICPP 2020
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450388160
DOIs
StatePublished - Aug 17 2020
Event49th International Conference on Parallel Processing, ICPP 2020 - Virtual, Online, Canada
Duration: Aug 17 2020Aug 20 2020

Publication series

NameACM International Conference Proceeding Series

Conference

Conference49th International Conference on Parallel Processing, ICPP 2020
Country/TerritoryCanada
CityVirtual, Online
Period8/17/208/20/20

Keywords

  • Convolutional neural networks
  • Distributed inference
  • Edge computing

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN'. Together they form a unique fingerprint.

Cite this