INFless: a native serverless system for low-latency, high-throughput inference Y Yang, L Zhao, Y Li, H Zhang, J Li, M Zhao, X Chen, K Li Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 54 | 2022 |
Rhythm: component-distinguishable workload deployment in datacenters L Zhao, Y Yang, K Zhang, X Zhou, T Qiu, K Li, Y Bao Proceedings of the Fifteenth European Conference on Computer Systems, 1-17, 2020 | 48 | 2020 |
Understanding, predicting and scheduling serverless workloads under partial interference L Zhao, Y Yang, Y Li, X Zhou, K Li Proceedings of the International conference for high performance computing …, 2021 | 37 | 2021 |
Tetris: Memory-efficient serverless inference through tensor sharing J Li, L Zhao, Y Yang, K Zhan, K Li 2022 USENIX Annual Technical Conference (USENIX ATC 22), 2022 | 32 | 2022 |
Optimizing geo-distributed data analytics with coordinated task scheduling and routing L Zhao, Y Yang, A Munir, AX Liu, Y Li, W Qu IEEE Transactions on Parallel and Distributed Systems 31 (2), 279-293, 2019 | 31 | 2019 |
Elax: Provisioning resource elastically for containerized online cloud services Y Yang, L Zhao, Z Li, L Nie, P Chen, K Li 2019 IEEE 21st International Conference on High Performance Computing and …, 2019 | 5 | 2019 |
Experience-availability analysis of online cloud services using stochastic models Y Cao, L Zhao, R Zhang, Y Yang, X Zhou, K Li 2018 IFIP Networking Conference (IFIP Networking) and Workshops, 1-9, 2018 | 4 | 2018 |
Component-distinguishable Co-location and Resource Reclamation for High-throughput Computing L Zhao, Y Cui, Y Yang, X Zhou, T Qiu, K Li, Y Bao ACM Transactions on Computer Systems 42 (1-2), 1-37, 2024 | | 2024 |
Computer Systems S Luo, K Ye, G Xu, L Zhang, G Yang, H Xu, C Xu, L Zhao, Y Cui, Y Yang, ... ACM Transactions on 42 (1-2), 2024 | | 2024 |
Rethinking Deployment for Serverless Functions: A Performance-First Perspective Y Li, L Zhao, Y Yang, W Qu Proceedings of the International Conference for High Performance Computing …, 2023 | | 2023 |
Flame: A Centralized Cache Controller for Serverless Computing Y Yang, L Zhao, Y Li, S Wu, Y Hao, Y Ma, K Li Proceedings of the 28th ACM International Conference on Architectural …, 2023 | | 2023 |
TailCmp-A Tail Latency Evaluation Solution of Public Cloud and Labeled von Neumann Architecture based Cloud Prototype X Kong, X Gao, S Pan, Y Zhou, Y Yang, L Zhao, H Qi 2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2022 | | 2022 |
SDCBench: A Benchmark Suite for Workload Colocation and Evaluation in Datacenters Y Yang, X Kong, L Zhao, Y Li, H Zhang, J Li, H Qi, K Li Intelligent Computing, 2022 | | 2022 |
Research Article SDCBench: A Benchmark Suite for Workload Colocation and Evaluation in Datacenters Y Yang, X Kong, L Zhao, Y Li, H Zhang, J Li, H Qi, K Li | | 2022 |
A Stochastic Model for Analyzing Tail Latency of Multi-Tier Online Cloud Services R Zhang, Y Yang, L Zhao, X Zhou, B Cai, K Li 2018 9th International Symposium on Parallel Architectures, Algorithms and …, 2018 | | 2018 |