{MLaaS} in the Wild: Workload Analysis and Scheduling in {Large-Scale} Heterogeneous {GPU} Clusters Q Weng, W Xiao, Y Yu, W Wang, C Wang, J He, Y Li, L Zhang, W Lin, ... 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2022 | 205 | 2022 |
Joint subcarrier and CPU time allocation for mobile edge computing Y Yu, J Zhang, KB Letaief 2016 IEEE Global Communications Conference (GLOBECOM), 1-6, 2016 | 189 | 2016 |
LRC: dependency-aware cache management for data analytics clusters Y Yu, W Wang, J Zhang, KB Letaief IEEE INFOCOM 2017-IEEE Conference on Computer Communications, 1-9, 2017 | 64 | 2017 |
Morphling: Fast, Near-Optimal Auto-Configuration for Cloud-Native Model Serving L Wang, L Yang, Y Yu, W Wang, B Li, X Sun, J He, L Zhang Proceedings of the ACM Symposium on Cloud Computing, 639-653, 2021 | 40 | 2021 |
Flow-level QoE of video streaming in wireless networks Y Xu, SE Elayoubi, E Altman, R El-Azouzi, Y Yu IEEE Transactions on Mobile Computing 15 (11), 2762-2780, 2015 | 26 | 2015 |
George: Learning to place long-lived containers in large clusters with operation constraints S Li, L Wang, W Wang, Y Yu, B Li Proceedings of the ACM Symposium on Cloud Computing, 258-272, 2021 | 24 | 2021 |
SP-cache: load-balanced, redundancy-free cluster caching with selective partition Y Yu, R Huang, W Wang, J Zhang, KB Letaief SC18: International Conference for High Performance Computing, Networking …, 2018 | 22 | 2018 |
Beware of Fragmentation: Scheduling {GPU-Sharing} Workloads with Fragmentation Gradient Descent Q Weng, L Yang, Y Yu, W Wang, X Tang, G Yang, L Zhang 2023 USENIX Annual Technical Conference (USENIX ATC 23), 995-1008, 2023 | 19 | 2023 |
Opus: Fair and efficient cache sharing for in-memory data analytics Y Yu, W Wang, J Zhang, Q Weng, KB Letaief 2018 IEEE 38th International Conference on Distributed Computing Systems …, 2018 | 15 | 2018 |
LERC: coordinated cache management for data-parallel systems Y Yu, W Wang, J Zhang, KB Letaief GLOBECOM 2017-2017 IEEE Global Communications Conference, 1-6, 2017 | 15 | 2017 |
Workload consolidation in alibaba clusters: the good, the bad, and the ugly Y Zhang, Y Yu, W Wang, Q Chen, J Wu, Z Zhang, J Zhong, T Ding, ... Proceedings of the 13th Symposium on Cloud Computing, 210-225, 2022 | 12 | 2022 |
Achieving load-balanced, redundancy-free cluster caching with selective partition Y Yu, W Wang, R Huang, J Zhang, KB Letaief IEEE Transactions on Parallel and Distributed Systems 31 (2), 439-454, 2019 | 9 | 2019 |
RepBun: Load-Balanced, Shuffle-Free Cluster Caching for Structured Data M Yu, Y Yu, Y Zheng, B Yang, W Wang IEEE INFOCOM 2020-IEEE Conference on Computer Communications, 954-963, 2020 | 4 | 2020 |
Towards Dependency-Aware Cache Management for Data Analytics Applications Y Yu, C Zhang, W Wang, J Zhang, K Letaief IEEE Transactions on Cloud Computing, 2019 | 4 | 2019 |
LACS: Load-Aware Cache Sharing with Isolation Guarantee Y Yu, W Wang, J Zhang, KB Letaief 2019 IEEE 39th International Conference on Distributed Computing Systems …, 2019 | 4 | 2019 |