Collaborative Inference for Deep Neural Networks in Edge Environments
Collaborative Inference for Deep Neural Networks in Edge Environments
Meizhao Liu,Yingcheng Gu,7 Autoren,Sheng Zhang
TLDR
This work investigates the optimization problem of collaborative inference in a heterogeneous system and proposes a scheme CIS, i to fill the gap in collaborative inference in heterogeneous edge environments with multiple edge servers, end devices and DNN tasks.
Abstract
Recent advances in deep neural networks (DNNs) have greatly improved the accuracy and universality of various intelligent applications, at the expense of increasing model size and computational demand. Since the resources of end devices are often too limited to deploy a complete DNN model, offloading DNN inference tasks to cloud servers is a common approach to meet this gap. However, due to the limited bandwidth of WAN and the long distance between end devices and cloud servers, this approach may lead to significant data transmission latency. Therefore, device-edge collaborative inference has emerged as a promising paradigm to accelerate the execution of DNN inference tasks where DNN models are partitioned to be sequentially executed in both end devices and edge servers. Nevertheless, collaborative inference in heterogeneous edge environments with multiple edge servers, end devices and DNN tasks has been overlooked in previous research. To fill this gap, we investigate the optimization problem of collaborative inference in a heterogeneous system and propose a scheme CIS, i
