Residential Collegefalse
Status已發表Published
FlexGen: Efficient On-Demand Generative AI Service with Flexible Diffusion Model in Mobile Edge Networks
Li, Peichun1; Dong, Huanyu1; Qian, Liping2; Zhou, Sheng3; Wu, Yuan4,5
2024
Source PublicationIEEE Transactions on Cognitive Communications and Networking
ISSN2332-7731
Abstract

Generative artificial intelligence (AI) in edge networks has excelled in delivering human-level creative services close to the end users. However, providing customized intelligence services to a wide range of end clients remains challenging due to the diverse demands of edge applications. In this paper, we present FlexGen, an efficient generative AI framework with flexible diffusion models, to tailor the intelligence service for different client-side requests under diverse quality and efficiency constraints. To this end, we first design and train a flexible diffusion model to support quality-and-cost adjustable image synthesis. After that, we focus on the server-side energy minimization problem subject to the quality level of generative service and the client-side latency constraint. We further theoretically characterize the relationship between the width of the diffusion model and the expected quality of the synthetic image. Following that, the decomposition solution is applied to optimize the generative service, where the image synthesis strategy and resource allocation policy are personalized for different client-side requests. Experiments indicate that, compared to existing image generation schemes, our framework can save up to two times energy consumption without sacrificing the quality of the service.

KeywordGenerative Ai Flexible Diffusion Model Resource Management
DOI10.1109/TCCN.2024.3522084
URLView the original
Language英語English
Scopus ID2-s2.0-85213409182
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Faculty of Science and Technology
THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Corresponding AuthorWu, Yuan
Affiliation1.State Key Laboratory of Internet of Things for Smart City and Department of Computer and Information Science, University of Macau, Macao, China
2.Zhejiang University of Technology, College of Information Engineering, Hangzhou, 310023, China
3.Tsinghua University, Beijing National Research Center for Information Science and Technology, Department of Electronic Engineering, Beijing, 100190, China
4.University of Macau, State Key Laboratory of Internet of Things for Smart City, The Department of Computer Information Science, Macao, Macao
5.Zhuhai UM Science and Technology Research Institute, Zhuhai, 519301, China
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Li, Peichun,Dong, Huanyu,Qian, Liping,et al. FlexGen: Efficient On-Demand Generative AI Service with Flexible Diffusion Model in Mobile Edge Networks[J]. IEEE Transactions on Cognitive Communications and Networking, 2024.
APA Li, Peichun., Dong, Huanyu., Qian, Liping., Zhou, Sheng., & Wu, Yuan (2024). FlexGen: Efficient On-Demand Generative AI Service with Flexible Diffusion Model in Mobile Edge Networks. IEEE Transactions on Cognitive Communications and Networking.
MLA Li, Peichun,et al."FlexGen: Efficient On-Demand Generative AI Service with Flexible Diffusion Model in Mobile Edge Networks".IEEE Transactions on Cognitive Communications and Networking (2024).
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Li, Peichun]'s Articles
[Dong, Huanyu]'s Articles
[Qian, Liping]'s Articles
Baidu academic
Similar articles in Baidu academic
[Li, Peichun]'s Articles
[Dong, Huanyu]'s Articles
[Qian, Liping]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Li, Peichun]'s Articles
[Dong, Huanyu]'s Articles
[Qian, Liping]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.