Residential College | false |
Status | 已發表Published |
Multi-Tier Workload Consolidations in the Cloud: Profiling, Modeling and Optimization | |
Ye, Kejiang1; Shen, Haiying2; Wang, Yang1; Xu, Cheng Zhong3 | |
2022 | |
Source Publication | IEEE Transactions on Cloud Computing |
ISSN | 2168-7161 |
Volume | 10Issue:2Pages:899-912 |
Abstract | Reducing tail latency becomes increasingly important to improve the user-perceived service experience. User-facing latency-sensitive cloud applications typically contain multiple interactive tiers (e.g., Web, App, Database) running in different virtual machines (VMs) with complex interaction patterns. However, such interactions between VMs in different tiers are often neglected in previous VM consolidation methods, resulting in poor application performance. In this article, we study the consolidation of multi-tier interactive workloads from a new perspective of user-perceived tail latency. We propose a novel profiling-based consolidation methodology to satisfy tail latency requirements while reducing the number of used physical machines. To achieve such a goal, we first perform large-scale profiling experiments under various consolidation settings in a KVM virtualized private cluster to establish the empirical performance values. We consider two key factors that affect the tail latency of multi-tier workloads: interference with co-located VMs and interaction between tiers. We model the consolidation of multi-tier workloads as an optimization problem with different objectives and constraints, and derive the consolidation schedule. We implement and evaluate the proposed models, as well as comparing with other methods (i.e., without profiling or without considering interaction influence). Extensive experimental results show that the proposed method is able to reduce up to 5X tail latency, compared with the method without profiling and up to 1.3X tail latency, compared with the method without considering the interaction influence between different tiers. |
Keyword | Cloud Computing Multi-tier Workload Server Consolidation Tail Latency |
DOI | 10.1109/TCC.2020.2975788 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Information Systems ; Computer Science, Software Engineering ; Computer Science, Theory & Methods |
WOS ID | WOS:000808079500012 |
Scopus ID | 2-s2.0-85132218964 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology |
Corresponding Author | Ye, Kejiang |
Affiliation | 1.Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, 518055, China 2.University of Virginia, Department of Computer Science, Charlottesville, 22904, United States 3.University of Macau, Faculty of Science and Technology, Taipa, Macao |
Recommended Citation GB/T 7714 | Ye, Kejiang,Shen, Haiying,Wang, Yang,et al. Multi-Tier Workload Consolidations in the Cloud: Profiling, Modeling and Optimization[J]. IEEE Transactions on Cloud Computing, 2022, 10(2), 899-912. |
APA | Ye, Kejiang., Shen, Haiying., Wang, Yang., & Xu, Cheng Zhong (2022). Multi-Tier Workload Consolidations in the Cloud: Profiling, Modeling and Optimization. IEEE Transactions on Cloud Computing, 10(2), 899-912. |
MLA | Ye, Kejiang,et al."Multi-Tier Workload Consolidations in the Cloud: Profiling, Modeling and Optimization".IEEE Transactions on Cloud Computing 10.2(2022):899-912. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment