Planaria: Dynamic architecture fission for spatial multi-tenant acceleration of deep neural networks

S Ghodrati, BH Ahn, JK Kim, S Kinzer… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) have reinvigorated real-world applications that rely on
learning patterns of data and are permeating into different industries and markets. Cloud …

Grandslam: Guaranteeing slas for jobs in microservices execution frameworks

RS Kannan, L Subramanian, A Raju, J Ahn… - Proceedings of the …, 2019 - dl.acm.org
The microservice architecture has dramatically reduced user effort in adopting and
maintaining servers by providing a catalog of functions as services that can be used as …

A comprehensive review of AI applications in Automated Container Orchestration, Predictive maintenance, security and compliance, resource optimization, and …

V Bandari - International Journal of Intelligent Automation …, 2021 - research.tensorgate.org
Artificial intelligence (AI) is a rapidly growing field with numerous applications, and
containerization is one area where AI can play a significant role. This research discusses …

Spock: Exploiting serverless functions for slo and cost aware resource procurement in public cloud

JR Gunasekaran, P Thinakaran… - 2019 IEEE 12th …, 2019 - ieeexplore.ieee.org
We are witnessing the emergence of elastic web services which are hosted in public cloud
infrastructures. For reasons of cost-effectiveness, it is crucial for the elasticity of these web …

Kube-knots: Resource harvesting through dynamic container orchestration in gpu-based datacenters

P Thinakaran, JR Gunasekaran… - … on cluster computing …, 2019 - ieeexplore.ieee.org
Compute heterogeneity is increasingly gaining prominence in modern datacenters due to
the addition of accelerators like GPUs and FPGAs. We observe that datacenter schedulers …

Knowledge representation of the state of a cloud-native application

J Kosińska, G Brotoń, M Tobiasz - International Journal on Software Tools …, 2024 - Springer
Cloud Computing has revolutionized the way applications are developed, deployed, and
maintained. Over the past decade, we have observed dynamically growing interest in Cloud …

Mocha: Multinode cost optimization in heterogeneous clouds with accelerators

P Zhou, J Sheng, CH Yu, P Wei, J Wang, D Wu… - The 2021 ACM/SIGDA …, 2021 - dl.acm.org
FPGAs have been widely deployed in public clouds, eg, Amazon Web Services (AWS) and
Huawei Cloud. However, simply offloading accelerated kernels from CPU hosts to PCIe …

Understanding energy efficiency in iot app executions

S Zhao, PV Rengasamy, H Zhang… - 2019 IEEE 39th …, 2019 - ieeexplore.ieee.org
Billions of Internet-of-Things (IoT) devices such as sensors, actuators, computing units, etc.,
are connected to form IoT platforms. However, it is observed that such hardware platforms …

Caliper: Interference estimator for multi-tenant environments sharing architectural resources

RS Kannan, M Laurenzano, J Ahn, J Mars… - ACM Transactions on …, 2019 - dl.acm.org
We introduce Caliper, a technique for accurately estimating performance interference
occurring in shared servers. Caliper overcomes the limitations of prior approaches by …

Quality aware batch scheduling of containers in cloud computing environment

SA Poojitha, K Ravindranath - International Journal of Information …, 2025 - Springer
The proposed scheduling method for cloud computing called “quality-aware batch
scheduling of containers (QABSC)” gives priority to timely completion of tasks that are …