UM

Browse/Search Results:  1-10 of 13 Help

Selected(0)Clear Items/Page:    Sort:
UELLM: A Unified and Efficient Approach for Large Language Model Inference Serving Conference paper
He, Yiyuan, Xu, Minxian, Wu, Jingfeng, Zheng, Wanyi, Ye, Kejiang, Xu, Chengzhong. UELLM: A Unified and Efficient Approach for Large Language Model Inference Serving[C]:Springer Science and Business Media Deutschland GmbH, 2025, 218-235.
Authors:  He, Yiyuan;  Xu, Minxian;  Wu, Jingfeng;  Zheng, Wanyi;  Ye, Kejiang; et al.
Favorite | TC[Scopus]:0 | Submit date:2025/01/22
Cloud Computing  Large Language Model Inference  Resource Management  Scheduling Algorithm  
LSRAM: A Lightweight Autoscaling and SLO Resource Allocation Framework for Microservices Based on Gradient Descent Journal article
Hu, Kan, Xu, Minxian, Ye, Kejiang, Xu, Chengzhong. LSRAM: A Lightweight Autoscaling and SLO Resource Allocation Framework for Microservices Based on Gradient Descent[J]. Software - Practice and Experience, 2024.
Authors:  Hu, Kan;  Xu, Minxian;  Ye, Kejiang;  Xu, Chengzhong
Favorite | TC[WOS]:0 TC[Scopus]:0  IF:2.6/2.5 | Submit date:2024/12/26
Gradient Descent  Lightweight  Microservices  Resource Autoscaling  Slo Allocation  
DRPC: Distributed Reinforcement Learning Approach for Scalable Resource Provisioning in Container-based Clusters Journal article
Bai, Haoyu, Xu, Minxian, Ye, Kejiang, Buyya, Rajkumar, Xu, Chengzhong. DRPC: Distributed Reinforcement Learning Approach for Scalable Resource Provisioning in Container-based Clusters[J]. IEEE TRANSACTIONS ON SERVICE COMPUTING, 2024, 1-12.
Authors:  Bai, Haoyu;  Xu, Minxian;  Ye, Kejiang;  Buyya, Rajkumar;  Xu, Chengzhong
Favorite | TC[WOS]:0 TC[Scopus]:0  IF:5.5/5.9 | Submit date:2024/08/05
Cloud Computing  Distributed Resources Management  Reinforcement Learning  Kubernetes  Microservice  
Practice of Alibaba cloud on elastic resource provisioning for large-scale microservices cluster Journal article
Xu, Minxian, Yang, Lei, Wang, Yang, Gao, Chengxi, Wen, Linfeng, Xu, Guoyao, Zhang, Liping, Ye, Kejiang, Xu, Chengzhong. Practice of Alibaba cloud on elastic resource provisioning for large-scale microservices cluster[J]. Software - Practice and Experience, 2024, 54(1), 39-57.
Authors:  Xu, Minxian;  Yang, Lei;  Wang, Yang;  Gao, Chengxi;  Wen, Linfeng; et al.
Favorite | TC[WOS]:7 TC[Scopus]:6  IF:2.6/2.5 | Submit date:2024/02/22
Alibaba  Cloud-native  Latency  Microservice  Resource Provisioning  
Efficient Multi-Task Computation Offloading Game for Mobile Edge Computing Journal article
Chu, Shuhui, Gao, Chengxi, Xu, Minxian, Ye, Kejiang, Xiao, Zhu, Xu, Chengzhong. Efficient Multi-Task Computation Offloading Game for Mobile Edge Computing[J]. IEEE Transactions on Services Computing, 2024, 17(1), 30-46.
Authors:  Chu, Shuhui;  Gao, Chengxi;  Xu, Minxian;  Ye, Kejiang;  Xiao, Zhu; et al.
Favorite | TC[WOS]:5 TC[Scopus]:7  IF:5.5/5.9 | Submit date:2024/02/22
Computation Offloading  Mobile Edge Computing  Multi-task  Nash Equilibrium  Potential Games  
ChainsFormer: A Chain Latency-Aware Resource Provisioning Approach for Microservices Cluster Conference paper
Song, Chenghao, Xu, Minxian, Ye, Kejiang, Wu, Huaming, Gill, Sukhpal Singh, Buyya, Rajkumar, Xu, Chengzhong. ChainsFormer: A Chain Latency-Aware Resource Provisioning Approach for Microservices Cluster[C], 2023, 197-211.
Authors:  Song, Chenghao;  Xu, Minxian;  Ye, Kejiang;  Wu, Huaming;  Gill, Sukhpal Singh; et al.
Favorite | TC[WOS]:4 TC[Scopus]:4 | Submit date:2024/02/22
Chain  Kubernetes  Microservice  Reinforcement Learning  Scaling  
Flash: Joint Flow Scheduling and Congestion Control in Data Center Networks Journal article
Gao, Chengxi, Chu, Shuhui, Xu, Hong, Xu, Minxian, Ye, Kejiang, Xu, Cheng Zhong. Flash: Joint Flow Scheduling and Congestion Control in Data Center Networks[J]. IEEE Transactions on Cloud Computing, 2023, 11(1), 1038 - 1049.
Authors:  Gao, Chengxi;  Chu, Shuhui;  Xu, Hong;  Xu, Minxian;  Ye, Kejiang; et al.
Favorite | TC[WOS]:11 TC[Scopus]:10  IF:5.3/4.6 | Submit date:2022/05/13
Cloud Computing  Congestion Control  Data Center Networking  Flow Scheduling  
CoScal: Multifaceted Scaling of Microservices With Reinforcement Learning Journal article
Minxian, Xu, Chenghao, Song, Shashikant, Ilager, Sukhpal Singh, Gill, Juanjuan, Zhao, Kejiang, Ye, Chengzhong, Xu. CoScal: Multifaceted Scaling of Microservices With Reinforcement Learning[J]. IEEE Transactions on Network and Service Management, 2022, 19(4), 3995-4009.
Authors:  Minxian, Xu;  Chenghao, Song;  Shashikant, Ilager;  Sukhpal Singh, Gill;  Juanjuan, Zhao; et al.
Favorite | TC[WOS]:29 TC[Scopus]:39  IF:4.7/4.6 | Submit date:2023/08/07
Cloud Computing  Workload Prediction  Microservices  Reinforcement Learning  Brownout  Scalability  
CoScal: Multi-faceted Scaling of Microservices with Reinforcement Learning Journal article
Xu, Minxian, Song, Chenghao, Ilager, Shashikant, Gill, Sukhpal Singh, Zhao, Juanjuan, Ye, Kejiang, Xu, Chengzhong. CoScal: Multi-faceted Scaling of Microservices with Reinforcement Learning[J]. IEEE Transactions on Network and Service Management, 2022, 19(4), 3995 - 4009.
Authors:  Xu, Minxian;  Song, Chenghao;  Ilager, Shashikant;  Gill, Sukhpal Singh;  Zhao, Juanjuan; et al.
Favorite | TC[WOS]:29 TC[Scopus]:39  IF:4.7/4.6 | Submit date:2023/01/30
Cloud Computing  Workload Prediction  Microservices  Reinforcement Learning  Brownout  Scalability  
Machine Learning-based Orchestration of Containers: A Taxonomy and Future Directions Journal article
Zhiheng, Zhong, Minxian, Xu, Maria Alejandra, Rodriguez, Chengzhong, Xu, Rajkumar, Buyya. Machine Learning-based Orchestration of Containers: A Taxonomy and Future Directions[J]. ACM Computing Surveys, 2022, 54(10).
Authors:  Zhiheng, Zhong;  Minxian, Xu;  Maria Alejandra, Rodriguez;  Chengzhong, Xu;  Rajkumar, Buyya
Favorite | TC[WOS]:51 TC[Scopus]:73  IF:23.8/21.1 | Submit date:2023/08/07
Container Orchestration  Machine Learning  Cloud Computing  Resource Provisioning  Systematic Review