Residential College | false |
Status | 已發表Published |
FlexGen: Efficient On-Demand Generative AI Service with Flexible Diffusion Model in Mobile Edge Networks | |
Li, Peichun1; Dong, Huanyu1; Qian, Liping2; Zhou, Sheng3; Wu, Yuan4,5![]() ![]() | |
2024 | |
Source Publication | IEEE Transactions on Cognitive Communications and Networking
![]() |
ISSN | 2332-7731 |
Abstract | Generative artificial intelligence (AI) in edge networks has excelled in delivering human-level creative services close to the end users. However, providing customized intelligence services to a wide range of end clients remains challenging due to the diverse demands of edge applications. In this paper, we present FlexGen, an efficient generative AI framework with flexible diffusion models, to tailor the intelligence service for different client-side requests under diverse quality and efficiency constraints. To this end, we first design and train a flexible diffusion model to support quality-and-cost adjustable image synthesis. After that, we focus on the server-side energy minimization problem subject to the quality level of generative service and the client-side latency constraint. We further theoretically characterize the relationship between the width of the diffusion model and the expected quality of the synthetic image. Following that, the decomposition solution is applied to optimize the generative service, where the image synthesis strategy and resource allocation policy are personalized for different client-side requests. Experiments indicate that, compared to existing image generation schemes, our framework can save up to two times energy consumption without sacrificing the quality of the service. |
Keyword | Generative Ai Flexible Diffusion Model Resource Management |
DOI | 10.1109/TCCN.2024.3522084 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85213409182 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Corresponding Author | Wu, Yuan |
Affiliation | 1.State Key Laboratory of Internet of Things for Smart City and Department of Computer and Information Science, University of Macau, Macao, China 2.Zhejiang University of Technology, College of Information Engineering, Hangzhou, 310023, China 3.Tsinghua University, Beijing National Research Center for Information Science and Technology, Department of Electronic Engineering, Beijing, 100190, China 4.University of Macau, State Key Laboratory of Internet of Things for Smart City, The Department of Computer Information Science, Macao, Macao 5.Zhuhai UM Science and Technology Research Institute, Zhuhai, 519301, China |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Li, Peichun,Dong, Huanyu,Qian, Liping,et al. FlexGen: Efficient On-Demand Generative AI Service with Flexible Diffusion Model in Mobile Edge Networks[J]. IEEE Transactions on Cognitive Communications and Networking, 2024. |
APA | Li, Peichun., Dong, Huanyu., Qian, Liping., Zhou, Sheng., & Wu, Yuan (2024). FlexGen: Efficient On-Demand Generative AI Service with Flexible Diffusion Model in Mobile Edge Networks. IEEE Transactions on Cognitive Communications and Networking. |
MLA | Li, Peichun,et al."FlexGen: Efficient On-Demand Generative AI Service with Flexible Diffusion Model in Mobile Edge Networks".IEEE Transactions on Cognitive Communications and Networking (2024). |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment