Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems M Salmani, S Ghafouri, A Sanaee, K Razavi, M Mühlhäuser, J Doyle, ... Proceedings of the 3rd Workshop on Machine Learning and Systems, 78-86, 2023 | 10 | 2023 |
[Solution] IPA: Inference Pipeline Adaptation to achieve high accuracy and cost-efficiency S Ghafouri, K Razavi, M Salmani, A Sanaee, TL Botran, L Wang, J Doyle, ... Journal of Systems Research 4 (1), 2024 | 2* | 2024 |