Residential Collegefalse
Status已發表Published
LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading
Liu, Hongzhou1; Zheng, Wenli1; Li, Li2; Guo, Minyi1
2022
Conference Name2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS)
Source PublicationProceedings - International Conference on Distributed Computing Systems
Volume2022-July
Pages481-491
Conference Date2022/07/10-2022/07/13
Conference PlaceBologna, Italy
PublisherIEEE COMPUTER SOC, 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA
Abstract

The emerging edge computing technique provides support for the computation tasks that are delay-sensitive and compute-intensive, such as deep neural network inference, by offloading them from a user-end device to an edge server for fast execution. The increasing offloaded tasks on an edge server are gradually facing the contention of both the network and computation resources. The existing offloading approaches often partition the deep neural network at a place where the amount of data transmission is small to save network resource, but rarely consider the problem caused by computation resource shortage on the edge server. In this paper, we design LoADPart, a deep neural network offloading system. LoADPart can dynamically and jointly analyze both the available network bandwidth and the computation load of the edge server, and make proper decisions of deep neural network partition with a light-weighted algorithm, to minimize the end-to-end inference latency. We implement LoADPart for MindSpore, a deep learning framework supporting edge AI, and compare it with state-of-the-art solutions in the experiments on 6 deep neural networks. The results show that under the variation of server computation load, LoADPart can reduce the end-to-end latency by 14.2% on average and up to 32.3% in some specific cases.

KeywordComputation Offloading Deep Neural Networks Inference
DOI10.1109/ICDCS54860.2022.00053
URLView the original
Indexed ByCPCI-S
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Hardware & Architecture ; Computer Science, Software Engineering ; Computer Science, Theory & Methods
WOS IDWOS:000877026100044
Scopus ID2-s2.0-85140918122
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionTHE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Corresponding AuthorLiu, Hongzhou
Affiliation1.Shanghai Jiao Tong University, School of Electronic Information and Electric Engineering, Shanghai, China
2.University of Macau, State Key Laboratory of IoTSC, Macau, Macao
Recommended Citation
GB/T 7714
Liu, Hongzhou,Zheng, Wenli,Li, Li,et al. LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading[C]:IEEE COMPUTER SOC, 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA, 2022, 481-491.
APA Liu, Hongzhou., Zheng, Wenli., Li, Li., & Guo, Minyi (2022). LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading. Proceedings - International Conference on Distributed Computing Systems, 2022-July, 481-491.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Liu, Hongzhou]'s Articles
[Zheng, Wenli]'s Articles
[Li, Li]'s Articles
Baidu academic
Similar articles in Baidu academic
[Liu, Hongzhou]'s Articles
[Zheng, Wenli]'s Articles
[Li, Li]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Liu, Hongzhou]'s Articles
[Zheng, Wenli]'s Articles
[Li, Li]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.