Deep learning workload scheduling in gpu datacenters: Taxonomy, challenges and vision
Deep learning (DL) shows its prosperity in a wide variety of fields. The development of a DL
model is a time-consuming and resource-intensive procedure. Hence, dedicated GPU …
model is a time-consuming and resource-intensive procedure. Hence, dedicated GPU …
Retiarii: A deep learning {Exploratory-Training} framework
Traditional deep learning frameworks such as TensorFlow and PyTorch support training on
a single deep neural network (DNN) model, which involves computing the weights iteratively …
a single deep neural network (DNN) model, which involves computing the weights iteratively …
Nautilus: An optimized system for deep transfer learning over evolving training datasets
S Nakandala, A Kumar - … of the 2022 International Conference on …, 2022 - dl.acm.org
Deep learning (DL) has revolutionized unstructured data analytics. But in most cases, DL
needs massive labeled datasets and large compute clusters, which hinders its adoption …
needs massive labeled datasets and large compute clusters, which hinders its adoption …
Horizontally fused training array: An effective hardware utilization squeezer for training novel deep learning models
Driven by the tremendous effort in researching novel deep learning (DL) algorithms, the
training cost of developing new models increases staggeringly in recent years. We analyze …
training cost of developing new models increases staggeringly in recent years. We analyze …
Saturn: An Optimized Data System for Large Model Deep Learning Workloads
K Nagrecha, A Kumar - arXiv preprint arXiv:2309.01226, 2023 - arxiv.org
Large language models such as GPT-3 & ChatGPT have transformed deep learning (DL),
powering applications that have captured the public's imagination. These models are rapidly …
powering applications that have captured the public's imagination. These models are rapidly …
Efficient supernet training using path parallelism
Compared to conventional neural networks, training a supernet for Neural Architecture
Search (NAS) is very time consuming. Although current works have demonstrated that …
Search (NAS) is very time consuming. Although current works have demonstrated that …
Declarative data serving: the future of machine learning inference on the edge
T Shaowang, N Jain, DD Matthews… - Proceedings of the VLDB …, 2021 - dl.acm.org
Recent advances in computer architecture and networking have ushered in a new age of
edge computing, where computation is placed close to the point of data collection to …
edge computing, where computation is placed close to the point of data collection to …
EdgeNN: Efficient Neural Network Inference for CPU-GPU Integrated Edge Devices
C Zhang, F Zhang, K Chen, M Chen… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
With the development of the architectures and the growth of AIoT application requirements,
data processing on edge has become popular. Neural network inference is widely employed …
data processing on edge has become popular. Neural network inference is widely employed …
Rotary: A Resource Arbitration Framework for Progressive Iterative Analytics
Increasingly modern computing applications employ progressive iterative analytics, as best
exemplified by two prevalent cases, approximate query processing (AQP) and deep learning …
exemplified by two prevalent cases, approximate query processing (AQP) and deep learning …
Riveter: Adaptive Query Suspension and Resumption Framework for Cloud Native Databases
In modern cloud environments, ephemeral resources with intermittent availability and
fluctuating monetary costs are becoming common. This dynamic nature presents a new …
fluctuating monetary costs are becoming common. This dynamic nature presents a new …