LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading

doi:10.1109/ICDCS54860.2022.00053

UM > THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)

Residential College	false
Status	已發表Published
	LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading
	Liu, Hongzhou 1; Zheng, Wenli 1; Li, Li2 ; Guo, Minyi 1
	2022
Conference Name	2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS)
Source Publication	Proceedings - International Conference on Distributed Computing Systems
Volume	2022-July
Pages	481-491
Conference Date	2022/07/10-2022/07/13
Conference Place	Bologna, Italy
Publisher	IEEE COMPUTER SOC, 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA
Abstract	The emerging edge computing technique provides support for the computation tasks that are delay-sensitive and compute-intensive, such as deep neural network inference, by offloading them from a user-end device to an edge server for fast execution. The increasing offloaded tasks on an edge server are gradually facing the contention of both the network and computation resources. The existing offloading approaches often partition the deep neural network at a place where the amount of data transmission is small to save network resource, but rarely consider the problem caused by computation resource shortage on the edge server. In this paper, we design LoADPart, a deep neural network offloading system. LoADPart can dynamically and jointly analyze both the available network bandwidth and the computation load of the edge server, and make proper decisions of deep neural network partition with a light-weighted algorithm, to minimize the end-to-end inference latency. We implement LoADPart for MindSpore, a deep learning framework supporting edge AI, and compare it with state-of-the-art solutions in the experiments on 6 deep neural networks. The results show that under the variation of server computation load, LoADPart can reduce the end-to-end latency by 14.2% on average and up to 32.3% in some specific cases.
Keyword	Computation Offloading Deep Neural Networks Inference
DOI	10.1109/ICDCS54860.2022.00053
URL	View the original
Indexed By	CPCI-S
Language	英語English
WOS Research Area	Computer Science
WOS Subject	Computer Science, Hardware & Architecture ; Computer Science, Software Engineering ; Computer Science, Theory & Methods
WOS ID	WOS:000877026100044
Scopus ID	2-s2.0-85140918122
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Conference paper
Collection	THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Corresponding Author	Liu, Hongzhou
Affiliation	1.Shanghai Jiao Tong University, School of Electronic Information and Electric Engineering, Shanghai, China 2.University of Macau, State Key Laboratory of IoTSC, Macau, Macao
Recommended Citation GB/T 7714	Liu, Hongzhou,Zheng, Wenli,Li, Li,et al. LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading[C]:IEEE COMPUTER SOC, 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA, 2022, 481-491.
APA	Liu, Hongzhou., Zheng, Wenli., Li, Li., & Guo, Minyi (2022). LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading. Proceedings - International Conference on Distributed Computing Systems, 2022-July, 481-491.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh