Residential College | false |
Status | 已發表Published |
LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading | |
Liu, Hongzhou1; Zheng, Wenli1; Li, Li2; Guo, Minyi1 | |
2022 | |
Conference Name | 2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS) |
Source Publication | Proceedings - International Conference on Distributed Computing Systems |
Volume | 2022-July |
Pages | 481-491 |
Conference Date | 2022/07/10-2022/07/13 |
Conference Place | Bologna, Italy |
Publisher | IEEE COMPUTER SOC, 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA |
Abstract | The emerging edge computing technique provides support for the computation tasks that are delay-sensitive and compute-intensive, such as deep neural network inference, by offloading them from a user-end device to an edge server for fast execution. The increasing offloaded tasks on an edge server are gradually facing the contention of both the network and computation resources. The existing offloading approaches often partition the deep neural network at a place where the amount of data transmission is small to save network resource, but rarely consider the problem caused by computation resource shortage on the edge server. In this paper, we design LoADPart, a deep neural network offloading system. LoADPart can dynamically and jointly analyze both the available network bandwidth and the computation load of the edge server, and make proper decisions of deep neural network partition with a light-weighted algorithm, to minimize the end-to-end inference latency. We implement LoADPart for MindSpore, a deep learning framework supporting edge AI, and compare it with state-of-the-art solutions in the experiments on 6 deep neural networks. The results show that under the variation of server computation load, LoADPart can reduce the end-to-end latency by 14.2% on average and up to 32.3% in some specific cases. |
Keyword | Computation Offloading Deep Neural Networks Inference |
DOI | 10.1109/ICDCS54860.2022.00053 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Hardware & Architecture ; Computer Science, Software Engineering ; Computer Science, Theory & Methods |
WOS ID | WOS:000877026100044 |
Scopus ID | 2-s2.0-85140918122 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Corresponding Author | Liu, Hongzhou |
Affiliation | 1.Shanghai Jiao Tong University, School of Electronic Information and Electric Engineering, Shanghai, China 2.University of Macau, State Key Laboratory of IoTSC, Macau, Macao |
Recommended Citation GB/T 7714 | Liu, Hongzhou,Zheng, Wenli,Li, Li,et al. LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading[C]:IEEE COMPUTER SOC, 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA, 2022, 481-491. |
APA | Liu, Hongzhou., Zheng, Wenli., Li, Li., & Guo, Minyi (2022). LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading. Proceedings - International Conference on Distributed Computing Systems, 2022-July, 481-491. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment